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Preface 


There are already a number of excellent textbooks which cover the subject of particle accel- 
erators, so why have we decided that there is a need for yet another one? The motivation 
comes from our experience at the Cockcroft Institute of supervising and teaching scientists 
and engineers who are new to the subject. We have found that schools, lectures, and text- 
books are great at passing on the fundamentals — which are absolutely core to our subject 
— but that it is the more practical side of particle accelerators that our staff and students 
sometimes fail to connect with. The aim of this book is therefore to not just explain the 
principles that underpin our field, but to also pass on some of the experience that we all need 
to design, build, and operate these marvellous machines. Many of the things we highlight 
are straightforward to convey, but before they are actually pointed out they can appear a 
little mysterious. We hope this book will give some useful guidance. 

Over the last ten years or so, the Cockcroft Institute of Accelerator Science and Technol- 
ogy in the UK has built up a teaching programme that encompasses the range of skills and 
methods required to design, construct, and operate particle accelerators across the range 
of uses to which they can be put. This range covers: the research required to understand 
and improve particle acceleration methods; the ‘traditional’ uses of accelerators in particle, 
nuclear, and atomic physics; and the applications outside of academic research in such areas 
as medicine, security, and industry. This book tries to reflect that scope; although we cannot 
cover all the myriad topics our rich subject has to offer, we have tried to cover enough topics 
to give a core understanding. 

Whilst there are a number of excellent reference guides on the subject — Chao and 
Tigner’s excellent handbook being a notable one, since it is found on the desk of nearly ev- 
ery accelerator scientist we have met — we found it difficult to find a practical introduction 
to the key topics and calculations an early-career professional in our field might encounter. 
That is the motivation for our textbook: as well as developing a number of topics in accel- 
erator physics (radio-frequency and other acceleration technology, magnetic design, beam 
dynamics, and radiation), we hope also to provide guidance on how to correctly apply those 
ideas, and in what practical situations different calculations ought to be used. This book 
therefore includes numerous worked examples that show the typical numerical quantities 
that may be encountered. Exercises are also included for the reader on key points, and these 
can be found at the end of the chapters. The solutions to all the exercises are freely available 
to download from the publisher’s web site page for this book. We have tried to make this 
book fall somewhere between a traditional textbook and a handbook of formulae. 


We hope you enjoy reading it as much as we enjoyed writing it. 
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An Invitation: Acceleration! 


This book is about particle accelerators, what they accelerate, how they work, and how 
(and why) we build them. In this opening chapter we shall take a short tour of a typical 
accelerator, moving along the beamline and making sense of the different kinds of accel- 
erator elements. This will give a kind of map for the rest of the book and a map of the 
subject. We hope the reader finds it useful for an overview and orientation. We are scientists 
and engineers, and as such are concerned with the observation and understanding of the 
physical world. The first step to any kind of deeper — and if we are lucky, quantitative — 
understanding of that world is to group and classify those aspects attracting our attention. 
How do we classify particle accelerators? Before we start, we should first define what a 
particle accelerator is. 

We may define a particle accelerator as a device — often called a ‘machine’ — that endows 
subatomic particles with large and variable amounts of kinetic energy. ‘Large’ here is in 
comparison with the sorts of energies one obtains from a particle source such as a simpler 
electron gun or ion source that might produce particles of tens of thousands of electron- 
volts (eV).* Particle accelerators differ from other sources of energetic particles — such as 
radioactive decay — in that an accelerator allows us (more or less) to freely choose the 
particle energy; for example, alpha particles from a given radionuclide — say, americium- 
241* — are emitted with only a single energy (of several MeV). We will see in the next 
chapter that electric fields are the predominant method of providing a particle with kinetic 
energy, and this demands that the particles we accelerate are charged so as to experience 
an acceleration from that field; the beams of particles that travel through an accelerator are 
therefore often described in terms of the equivalent current they carry. However, there are 
also so-called secondary sources of particles, some of which may be electrically neutral; three 
important examples are the photon, the neutron and the neutrino, all commonly produced 
by accelerators and used extensively in science and engineering for quite different things. 

In the following chapters we will deal with the manner in which particles are produced, 
accelerated and used — each of which of course depends on the particular particle. But first 
let us take an overview, and attempt to classify them by type. A first observation is that 
some accelerators are straight (i.e. linear), in which the accelerated species pass through 
each element of the accelerator only once; often the predominant element is the one that 


*The electron-volt is generally the most appropriate unit to quantify that kinetic energy. 
*Americium-241 is chosen as an example here because it is the most commonly-encountered radioisotope; 
around 1 Curie activity (about 0.3 mg of AmOz2) is present in virtually every domestic smoke detector. 
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performs the acceleration. These are usually called linear accelerators, or linacs for short. 
The alternative is the circular accelerator, in which particles circulate many times, very often 
repeatedly through the same elements; this can allow the re-use of accelerating elements 
(such as accelerating gaps or cavities), or allow repeated production of some secondary 
species — such as photons, for instance — from the same primary particle. 

So we can classify accelerators into two broad categories — those that accelerate particles 
in a straight line and those that accelerate particles approximately in a circle (usually called 
a ring). In the straight (or linear) type, the particles start at one end and pass through 
every element only once (including the accelerating elements), finishing up at the end of the 
accelerator. This type of linear accelerator (usually abbreviated as ‘linac’) is very common 
and is used all over the world, mostly commonly as the device that supplies electrons 
at ~10 MeV in an X-ray radiotherapy machine. To construct the second type, we imagine 
bending our linear accelerator into a ring using dipole magnets (also therefore called bending 
magnets), so that the particle makes many laps (or turns) of the ring, also passing through 
the elements making up that ring many times. A widely-encountered example of this type 
of accelerator is the synchrotron; many synchrotrons are used today to produce high-energy 
photons (with energies typically of a few keV or more) by bending the circulating beam 
of electrons; these photons are then used in a variety of techniques by researchers. Other 
synchrotrons are used to accelerate — for example — protons to high energies to hundreds of 
GeV or more to undergo collisions to study particle physics. 

If we visit a particle accelerator (‘accelerator’ for short) we find that many are composed 
of several distinct systems, each of which is commonly regarded as an accelerator in its own 
right. A famous example is CERN’s Large Hadron Collider (LHC), a proton accelerator that 
lies many metres under the ground near Geneva, and which is large enough, with a 27 km 
circumference, to cross the Swiss-French border twice! The LHC facility is really a number of 
connected accelerators, and the protons begin their lives within a bottle of hydrogen gas. An 
ion source is used to strip the electrons from the hydrogen atoms and deliver the protons at 
some (modest) energy of 70 keV to a pre-injector; a chain of further accelerators, first linacs 
and then circular synchrotrons, progressively increases each proton’s energy to a final value 
of 6.5 TeV (tera-electron-volts).* Two independent beams of protons of the same energy 
travel around the ring, one clockwise and the other counter-clockwise, and these are made 
to collide head on into each other at specific locations within the storage ring to produce 
reaction products useful for experiments in fundamental particle physics.* Another example 
is the free-electron laser (or FEL). Here, electrons are generated from an electron gun (a 
cathode from which electrons are emitted in the presence of a strong voltage, sometimes 
with some assistance from a short pulse of laser photons) and then progressively accelerated 
by a linac; these electrons then pass through a special magnetic device that prompts the 
electrons to emit light with laser-like properties (the FEL proper) and so generate tailored 
pulses of photons. An example of an electron gun is shown in Fig 1.1. 

The basic building blocks of any accelerator are: the devices that generate the particles 
(the sources); the devices that accelerate the particles, which is almost always done with 
electric fields; and the devices that confine and control the particles, which are commonly 
built using magnets. For example, electromagnets are used to deflect (bend) particles into 
a curved path — so-called dipole electromagnets are used to construct a circular accelerator. 
Other electromagnets such as quadrupoles, sextupoles and so on are used to confine particles 


*7 TeV is the anticipated energy in the future. 
*A storage ring is a type of synchrotron, but one in which the energy of the circulating particles is 
constant. 
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FIGURE 1.1 Here we see one of the electron guns at CLARA, situated at Daresbury Laboratory. This 
is a typical photo-injector source, in which a pulsed laser is directed (from the left) onto a cathode (on the 
right of the photograph) and produces an intense, short-duration bunch of electrons that contains tens of 
picocoulombs of charge. The electrons are then accelerated to the left by a strong oscillating electric field 
produced in several coupled cavities, up to a kinetic energy of several MeV. ©STFC 


into some desired size (envelope). All these devices generate a predetermined magnetic 
field that is experienced by the passing charged particles, using some sort of beam-optical 
arrangement; an example is shown in Fig 1.2. 

The effect of the electric and magnetic fields can be summarised by a single basic law 
that is the most important equation encountered in this book, and in the field of particle 
accelerators — the Lorentz equation (also known as the Lorentz force law). This describes 
the force F on a particle with charge q from an electric field E and magnetic field B and is 
given by 

F=q(E+vx B), (1.1) 


where v is the particle velocity. The consequences of this seemingly-simple equation — com- 
bined with the other laws of electromagnetism — will occupy us in the following chapters, but 
straight away we seem two very important differences between the way electric and mag- 
netic fields act upon charged particles; electric fields can perform work upon the charges, 
and therefore can impart (kinetic) energy to them, whereas magnetic fields produce a force 
at right angles to a charge’s motion and so do no work. A static magnetic field cannot change 
the energy of a charged particle, and particularly can do nothing at all if the charge is sta- 
tionary; we will discuss this more in the next chapter. We also note now the very important 
consequence of special relativity, which is that our accelerated particles often significantly 
increase in mass rather than velocity as they gain energy. This is always considered when 
calculating the effect of the Lorentz force, and obviously relativity ultimately determines 
that our particles cannot travel faster than the speed of light, c. Surprisingly, the ideas of 
quantum physics do not normally have to be considered, although on occasion we will; this 
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FIGURE 1.2 Part of the ALICE energy-recovery linac electron beam transport system, previously in- 
stalled at Daresbury Laboratory. Individual beamline magnets are typically mounted onto a girder (usually 
steel), adjusted so that their magnet centres are aligned and then fixed (with position accuracy of some 
tens of um); the girder may then be lifted into its operating position without significant relative movement 
of the magnets, and they may be adjusted together for efficiency. A typical alignment accuracy from girder 
to girder of a few tens of um is also commonly achieved. Q@STFC 


is most often encountered in the context of photon emission from charges. An important 
but sometimes overlooked aspect of particle accelerators is that that the beam pipe through 
which the particles pass (and which the electromagnets surround) must be evacuated to 
allow the particles to pass by with little scattering or absorption; residual gas within the 
vacuum system can give rise to such undesirable phenomena as particle loss, emittance 
degradation (blow-up), ion trapping by electron beams and the analogous electron cloud 
instability experienced by proton beams. 

The most important magnetic devices are the dipole and the quadrupole. Let’s look at a 
dipole first, which can be seen in Fig 1.3, which induces charged particles to follow a curved 
path (the arc of a circle to be exact); it bends a beam of particles and is composed of two 
poles (north and south). Linacs often need to utilise dipoles to produce some defined bend 
angle, for example to steer a produced beam to a precise final location; of course, a circular 
accelerator requires 360°-worth of total bend, and this is typically achieved using a number 
of dipoles each contributing a part of the overall deflection. The cyclotron is an example of a 
circular accelerator that utilises only one dipole, in the form of a single circular magnet. In 
a dipole a combination of current-carrying copper coils and steel poles produces an almost 
uniform magnet field, bending the passing charges through some angle determined by the 
magnetic field and each particle’s momentum. This deflection angle 0 is proportional to the 
field strength B; for very large values of B the coil currents must be large, and this may 
require them to be superconducting. As a general guide, ordinary electromagnetic dipoles 
generate fields up to 1 or 2 T, and superconducting dipole magnets typically generate fields 
up to ~8 T (with the prospect for significantly higher fields than this in the future). In 
Chapter 4 we will see how magnet designers construct dipole magnets to some specified 
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FIGURE 1.3 A dipole magnet used to deflect (bend) a beam of charged particles. A current-carrying 
coil — here made of copper channels wound a number of times around each of the two pole pieces — drives 
the magnetic field. The outer yoke closes the magnetic field lines to maximise efficiency and to limit the 
stray field away from the magnet so the magnetic field is essentially only present in air between the north 
and south poles. A vacuum vessel between the poles follows the 60° deflection angle, but also includes extra 
pieces (here with temporary flanges on) that allow for other uses such as vacuum pumping or for emitted 
light to be extracted and utilised. Q@STFC 


strength and field accuracy and we’ll see in t how we can compute the motion of 
particles in a (perhaps non-ideal) dipole field. 

The quadrupole — the ‘four-pole’ magnet — is often the most numerous type of magnet 
found in an accelerator, and can be seen in F . The magnetic field inside the aperture of 
a quadrupole has a strength B, « x (the vertical field rises as one moves horizontally from 
the magnet centre) and also B, «x y; a quadrupole provides a gradient g = 0B/Ox in the 
field with zero field at the magnet centre, so on average provides no deflection at all. As a rule 
of thumb we typically use gradients of 10 to 100 T/m; smaller magnet apertures make larger 
gradients easier to achieve. The purpose of quadrupoles is to focus and so basically to confine 
the beam to within some stable envelope, and in we will discuss Hill’s equation 
and how it determines if an arrangement of quadrupoles — called a magnetic lattice — gives 
a stable focusing channel. Often we use a matrix formalism, based on Hill’s equation, which 
allows us to follow — or track — the paths of individual particles. We will see in t how 
the Courant-Snyder formalism can be used to describe the envelope around those particles 
using the so-called -function and the other Twiss functions. We will also discuss higher- 
order magnets with more poles, such as the sextupole (in Ch and 5); these are 
commonly used to correct beam-optical aberrations and thereby enable magnet lattices to 
give better stability to the transported particles. Most particle accelerators require magnets 
that generate these higher-order fields. 
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At the heart of any accelerator are the devices that generate the accelerating fields, which 
we will see in Chapter 2 can only be electric fields. A common device is the accelerating 
cavity, within which a time-varying (oscillating) electric field is generated by means of 
some source of electromagnetic energy. Time-varying fields are often employed as they can 
more conveniently deliver large electric fields to the particles, albeit with the restriction 
that we then need to correctly time when those particles pass through the cavity. Particle 
beams are therefore very often bunched so that each bunch arrives within the cavity at the 
correct phase to see an accelerating field (other phases provide either less acceleration, no 
acceleration or will decelerate the particles). The fields in the cavity oscillate in time and 
obey the wave equation; we’ll see in Chapter 3 that the longitudinal electric field accelerates 
particles in the accelerating mode. Note this oscillation in time at a specific frequency and 
with a characteristic spatial field is typical of resonant structures. Accelerating electric fields 
are often measured in terms of MV/m; an example of a cavity that makes this sort of field 
is shown in Fig 1.4. 

To generate a large field, cavities are generally resonators that store electromagnetic 
energy, and they can be described also in terms of the voltage gain (which is proportional 
to the energy gain) a charge sees as it crosses from one end of the cavity to the other; the 
voltage is the integral of the peak accelerating electric field E, modified by what oscillation 
phase a particle sees as it travels through that field. Much of modern-day development of 
cavities and other kinds of accelerating structures is to achieve the highest possible gradient; 
a larger gradient generally means a smaller — and therefore cheaper — accelerator. Modern- 
day cavities seek to provide gradients as high as ~100 MV/m or more, depending upon 
how they operate and for which application; superconductivity is again often employed to 
limit ohmic energy losses in the body of the cavity and thereby to enable more efficient 
accelerators or to achieve parameters that would otherwise not be possible. However, some 
types of accelerators — notably cyclotrons and synchrotrons — do not need such high gradients 
as they can use a smaller voltage repeatedly; in this case, the focus can be more on efficiency 
and limiting power losses. Large accelerating fields of perhaps 1 GV/m or more can be 
produced when a plasma has induced within it a significant separation of the electrons from 
the positive ions; this can be brought about by a variety of means, including either the 
strong electric field from a focused laser pulse or the passage of a particle bunch. This is an 
active area of current research, and we will mention it briefly. 

The primary particles produced by accelerators are often used directly: for example, the 
LHC collides very high-energy protons upon each other after accelerating them, whilst a 
low-energy (several MeV) electron linac can be used to irradiate and sterilise food products 
and medical equipment. Very often we encounter targets onto which these particles are 
directed. Some of these targets are used to generate secondary radiation; an important 
example is to produce neutrons, where heavy-metal spallation targets taking very intense 
proton beam powers ~1 MW are commonly used. Those neutrons are enormously important 
for studies of chemistry, physics and engineering studies of novel materials. Particle physics 
experiments increasingly also call for high-intensity beams to generate such things as muons 
and neutrinos, the former for future muon colliders and the latter to see signatures of physics 
beyond the standard model. At lower particle energies around 10 MeV, electron linacs are 
used to generate bremsstrahlung photons for use in radiotherapy or for scanning cargo; 
in fact, this is the most likely situation someone will encounter a particle accelerator and 
there are around 30,000 such accelerators around the world — the majority, in fact. Another 
medical application is the use of proton cyclotrons to generate radioisotopes; fluorine-18 
is the most commonly-produced isotope, made by directing ~ 10 MeV protons onto an 
enriched water target. Higher-energy protons and other species such as carbon-12 ions are 
directed into patients to perform particle therapy, another form of radiotherapy. We will 
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FIGURE 1.4 An accelerating cavity for X-ray cargo scanning, shown cut in half to reveal the internal 
accelerating cells that oscillate in voltage together at the same frequency. Electrons enter from the left at 
a low velocity and exit on the right after their acceleration; as they gain velocity they travel further per 
oscillation period and hence the cavity cells must get longer. 


not describe these applications in detail, but will discuss the basics for the operation of the 
types of accelerator they employ. 

From an accelerator point of view the most important secondary particle phenomenon is 
that of photon production; it’s a fundamental behaviour when charges accelerate in electro- 
magnetic fields, and so we discuss it in detail in Chapter 6. We show the basic connection 
between the bremsstrahlung utilised in radiotherapy and the production of photons via 
synchrotron radiation. So-called synchrotron light sources are a widespread application of 
electron accelerators — there are nearly a hundred such facilities around the world now — 
and they make use of the enormous enhancement of photon production when electrons with 
a large kinetic energy travel through a specific magnetic field arrangement. 

Regardless of the type of accelerator, there are limitations in the accuracy with which 
it is built, and this must be considered during the design. For example, misalignments 
of magnets give rise to trajectory errors which must be corrected using additional small 
steering magnets (also called corrector magnets). The effect of the misalignments them- 
selves must be measured using suitable diagnostic instrumentation such as beam position 
monitors (electrostatic pickups, screens and so forth) and measurements of the total beam 
charge/current and the dimensions of the beam. In virtually every modern accelerator there 
is a control system in which a coordinated set of computers and instrumentation interfaces 
brings together measurements of the operation and status of each element of the accelera- 
tor, to allow adjustment of the operation of it. These are most important during the initial 
commissioning of the accelerator with particle beams. 

We see that the field of particle accelerators is very broad, and we have not mentioned 
many of the possible subsystems that may be encountered; that is the job of more specialist 
texts. Here, we attempt to give an overview of the principle ideas involved in the practical 
design and construction of an accelerator system, and discuss the most-often encountered 
components and phenomena; we therefore restrict ourselves to discussing mostly conven- 
tional accelerator components, that is the electromagnets and accelerating cavities used 
today in the vast majority of particle accelerator facilities. 

We divide our discussion into the following chapters. In Chapter 2 we discuss the basic 
ideas of charges and electromagnetic fields that underpin all the most important ideas in 


8 The Science and Technology of Particle Accelerators 


accelerator science — this is our fundamental ‘ABC’ chapter. Next, in Chapter 3, we shall dis- 
cuss the methods used to accelerate particles, mainly resonant cavities and their behaviour. 
In Chapter 4 we meet magnets, and explain the three basic technologies used in their de- 
sign and construction: electromagnets, permanent magnets, and superconducting systems. 
The following Chapter 5 introduces beam dynamics, the methodology used to describe and 
predict the behaviour of particles as they move through an accelerator, including a dis- 
cussion of non-ideal situations such as machine imperfections. The production of photons 
by charges is the subject of Chapter 6, including the very important field of synchrotron 
radiation. Finally, Chapter 7 discusses a few of the most important complexities that arise 
when particles interact with each other. 


ABC: Accelerators, Beams, and 
Charges 


2.1 The Electromagnetic Field and Its Properties .... 10 
Maxwell’s Equations * Forces on Charged Particles 

Bat TCIM ASST EA O E A ehaanpeoaged ahh 13 

2.3 Particle Motion in Electromagnetic Fields......... 14 


Curvature in a Magnetic Field * Conservation of 
Energy in Electromagnetic Fields * Energy in 
Electromagnetic Plane Waves * Radiation Pressure 


2.4 The Basics of Acceleration ...................0000000s 23 
2.5 The Particles Used in Accelerators ................. 25 
20 The Tener ABU ceciesecercpcbisotedeinddnebigenicn ar 
Bertem Gee a a eaS ak 


The realm of particle accelerators is effectively that of electromagnetism, where we have 
also to include aspects of the theory of special relativity since in most cases our particles 
are moving at some significant fraction of the speed of light, c = 2.99792 x 108 ms~!. This 
chapter will explain the key ideas that lay the foundation for the rest of the book. We 
begin with a brief review of the electromagnetism, assuming that the reader has had some 
introduction to the topic at the undergraduate level; there are a number of excellent texts 
on this subject [1, 2, 3, 4]. We then discuss the effect of externally-applied electric and 
magnetic fields upon charged particles, and discuss when relativistic effects are important. 
We then discuss the exchange of energy between the electromagnetic field and a set of 
particles, including the exchange of energy with electromagnetic radiation. We leave the 
discussion of the production of electromagnetic radiation — by the particles themselves — 
to Chapter 6. Charges may also exert significant influence upon each other due to their 
mutually-experienced electric and magnetic fields, and we will introduce that topic ready 
for a longer discussion in Chapter 7. 

It may initially be somewhat surprising that quantum ideas do not often have to be 
invoked in particle accelerator science; after all, the particles such as electrons and protons 
with which we are concerned are the basic quantised building blocks in nature. However, we 
will see that most of the more-or-less classical description of electric and magnetic fields will 
suffice. For certain phenomena — principally those in which photon emission and absorption 
is involved — we will occasionally have to resort to quantum ideas. 
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2.1 The Electromagnetic Field and Its Properties 


2.1.1 Maxwell’s Equations 


Electromagnetism, meaning the motion and dynamics of charges and currents, is governed 
in the most general way by Maxwell’s equations. In the classical description of charged 
particles, a stationary charge of magnitude q exerts an electric field that is both isotropic 
and extends to infinity instantaneously; ‘instantaneously’ means that the presence of a 
charge can be immediately experienced at some location a distance l away. Straight away 
we see that this cannot really be true, since the ideas of special relativity tell us that if a 
charge is moved, it takes some time t = l/c for a distant observer to become aware of that 
motion; this is the idea of the retarded time; we discuss this later in Chapter 6. Charges 
are either positive or negative, and one may think of them therefore as being either sources 
or sinks of field lines; the number of field lines is proportional to the charge. 

Field lines must begin and end on charges — there can be no discontinuities in the field 
lines (a point with a discontinuity implies extra charge there), see Fig 2.1. Therefore, if we 
surround a charge (or set of charges) with a surface and count up the field lines passing 
through the surface, the total number is proportional to the charge (we do this correctly by 
counting field strength perpendicular to the surface, i.e. E -dS — the density of field lines is 
proportional to E). What we have described here is Gauss’s Law, which is 


gpas=4, (2.1) 
s £0 

where S is the surface that encloses the volume V. €9 is the constant of proportionality 
between field and charge, and is known as the permittivity of free space. A point charge 
(such as a single particle) lying at the centre of two concentric spherical surfaces 1 and 2 
with radii r; and r2 must have the same number of field lines passing through both surfaces 
(Fig 2.2). In this case we have spherical symmetry* and so Gauss’s Law becomes 


Anr?E, = 4rr2 E> (2.2) 


where F; and E> are the magnitudes of the electric fields passing through each surface (per- 
pendicularly in this case). From this we see that electric field strength around a stationary 
charge obeys the well-known inverse-square law 
q 
E(r) = ——~. 2.3 

(r) Ategr? (=al 
The differential form of Gauss’s Law is a restatement of the idea that charge creates field, 
and is 


V-E= =. 
€0 


It is an observed fact that there are no sources or sinks of magnetic field, hence the 
equivalent to Gauss’s Law for magnetic fields is (Fig 2.3) 


fB -dS = 0. (2.4) 
The differential form for this equation is 


V-B=0. 


*In other words, we cannot tell which way round a point charge is facing — it has no inherent orientation. 
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V+ 


Source Sink 


FIGURE 2.1 Charges are sources or sinks of electric field lines; the number of field ‘lines’ is proportional 
to the amount of charge q. 


Surface 1 


Surface 2 


FIGURE 2.2 Two concentric surfaces surrounding a given charge must have the same number of field 
lines passing through each surface. 


No 


FIGURE 2.3 The total magnetic flux passing through a surface must sum to zero (top figure). If the 
total flux were non-zero (lower figure) it would imply that there was a source of magnetic field within the 
volume bounded by that surface; this is not possible for magnetic fields. 
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Time-varying electric fields can generate a magnetic field, as can a motion (flow) of 
charges; a flow of charges is a current. The generated magnetic field is described by Ampere’s 
Law, most easily expressed as 


OE 
V x B = poj + Hoco = 


2.5 
AE (2.5) 
Similarly, a time-varying magnetic field can generate an electric field; this is Faraday’s Law 
OB 
Vx E = -— 2.6 
x va (2.6) 


(there is no equivalent term to electric current in this equation because there are no such 
things as magnetic ‘charges’). Note the minus sign in Faraday’s Law: the induced electric 
field acts to oppose the change in magnetic field — this is Lenz’s Law. 

Another important thing we know in electromagnetism is that charge is conserved. 
Hence, if we have a volume V and consider the charges that may be moving into or out of 
it, we may relate the current j flowing through the surface S around V to the change of the 


total charge Q within it as 
dQ 
dS = —-—— 2.7 
$ j: dt (2.7) 


Using the divergence theorem we can rewrite the left-hand side of this equation as 


fias= f V -jdV, (2.8) 
S V 


and write Q = fy pdV in terms of the charge density p(r) inside V. Hence 


a y jav = È pdV = Le oP av, (2.9) 


where we can make the latter transformation because we choose that the volume V does 
not change with time. But the chosen volume V may be of any size; we can reduce it to an 
arbitrarily small volume. We can therefore remove the integrals from this equation to yield 
dp _ 
aa (2.10) 
This is the Continuity Equation; it’s just the differential statement that charge is conserved 
when charges are moving. 


2.1.2 Forces on Charged Particles 


The force F on a charge q moving with velocity v within the presence of an electric field E 
and magnetic field (or more correctly, the magnetic flux density) B is given by the Lorentz 
force law 

F = q(E +v x B). (2.11) 


We see that the electric force points in the direction of the field, and hence the electric 
field E can do work upon the charge (or vice versa); the acceleration of a charge due to an 
electric field is E 

gs (2.12) 


m 


However, magnetic fields do no work since the force exerted on the charge is at right-angles 
to the direction of B due to the cross product; also, charges must be moving in order to 
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experience a force due to a magnetic field B. It is typical in many accelerators to have 
charges moving at speeds close to the velocity of light, c. An electromagnet providing a 
field B = 1 T will produce a force |F| ~ cB equivalent to the force exerted by a electric 
field of EF = 300 MV/m. A 1 T electromagnet is routine, but a 300 MV/m electric field is 
highly challenging to produce and is only really encountered in plasma accelerators. It is for 
this technological reason that magnetic fields rather than electric fields are predominantly 
used to guide and focus the charged particles in an accelerator. However, magnetic fields do 
no work and so cannot change the speed of a charged particle. A charged particle moving 
through a uniform magnetic field B will experience a constant force at right-angles to its 
motion, and hence move in a circular path due to a transverse acceleration a, = quB/m. 
The radius of that circle, p, mapped out by the particle is dependent upon the momentum 
of the particle and the magnetic field strength, B, and is 
mu 
= on 


Obviously, both accelerations (electric and magnetic) depend upon the charge’s mass. At 
sufficient velocities that mass is no longer the rest mass, but increases due to relativistic 
effects. 


p (2.13) 


2.2 Relativity 


A charged particle may be accelerated to large velocities such that its kinetic energy becomes 
comparable to or much greater than its rest energy; the effects of relativity must then be 
taken into account, and this is the case for nearly all the situations encountered in particle 
accelerator science. The behaviour under the conditions for special relativity will however 
suffice rather than any effects due to general relativity. 

In this book, we adopt the notation conventions mo for the rest mass of a particle and Eo 
for the rest energy, and we use the convenient units of MeV for energy and MeV/c? for mass; 
1 eV is equal to about 1.602 x1071° J. Since the rest energy of a particle is just Ey = moc’, 
we may readily convert for example the rest mass of an electron me = 9.109 x 1073! kg — 
which is equal to about 0.511 MeV/c? — to a rest energy of 0.511 MeV; the numerical value 
is the same. 

The most important variable to determine for a fast-moving, high-energy particle is its 
relativistic ‘gamma’ or ‘Lorentz’ factor, which is the ratio of the total energy of the particle 
to its rest energy 

E T + Eo 
Eo Eo 
where we denote the kinetic energy of the particle as T; a stationary particle has T = 0 
and so y = 1, and y may be arbitrarily large for a large kinetic energy. The mass of a fast- 
moving particle increases to m = ym and an elapsed time to in its own frame of reference 
is dilated to At = yAto when viewed by a (stationary) observer; hence an unstable particle 
with lifetime 7 takes longer to decay if it is moving rapidly. The particle velocity is v and 
so the velocity relative to the velocity of light c is given by 6 as 

v 1 
such that 8 cannot be greater than 1. At very low velocities, this relation reduces to the 
classical relation v = \/2T/mo = cą/2T/ Eo. Particle momentum is always given as p = mv, 
but it is useful to express it as 


y (2.14) 


(2.15) 


p = nw = Bymoc = ByEo/c = = (2-1). (2.16) 
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Often we are interested in cases where y > 1, in which case the quantities above have much 
simpler expressions: 


po 1, 
Um C, 
p> moe, 


E~ pe. 


A particular example illustrates these concepts. We imagine two particles — a proton 
and an electron — each with kinetic energy T = 250 MeV. The total proton energy is just 
E ~ 1181 MeV (the proton rest energy is Eo ~ 938 MeV). The proton has a y not much 
more than 1 (y ~ 1.27) such that its velocity is 8 ~ 0.614. The momentum is then just 
p ~ 731 MeV/c; note that all the numerical values (T, E, and p) are different from each 
other. 

By contrast, an electron with T = 250 MeV is ‘ultrarelativistic’ with y ~ 490, 6 = 1 (to 
a very good accuracy), total energy E = 250.511 MeV and p = 250.511 MeV/c. Note that 
E ~ T so that often these values are interchanged, and when considering ultrarelativistic 
particles for most practical purposes, the difference between T and E doesn’t matter (it’s 
generally far smaller than other uncertainties in other accelerator parameters). 


2.3 Particle Motion in Electromagnetic Fields 


2.3.1 Curvature in a Magnetic Field 


We again consider the separate effect of electric and magnetic forces upon a charge. In 
the electric field case, as the charge gains in velocity its mass increases and restricts the 
ultimate velocity to a value less than c. In the magnetic field case, the charge is accelerated 
transversely without gaining energy, but now 

ymov 


= ap (2.21) 


We often write this expression as 
P 
(Bp) = 7 (2.22) 


and call (Bp) or p/q the beam rigidity. The beam rigidity p/q describes the resistance of a 
beam of charged particles to being bent into a radius p by a given magnetic field B; for a 
given rigidity (Bp), doubling the field will halve the bend radius. We may express the beam 
rigidity equation in several ways. Expressing the momentum in units of GeV/c (1 GeV/c is 
equivalent to 5.3 x1071° kg ms~'), the bending radius is 


p[GevV /c] 


plm] = 3.33 BIT] 


(2.23) 


In general, the bending radius is 


Eoy? -1 (2.24) 


qcB 
for a particle of rest energy Eo. For protons this is 


plm] = 3.13/77 — 1 (2.25) 


BT] 


p= 
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and for electrons 


ae 1.71 x 10-3.\/7? — 1 (2.26) 
plm] = BT] . ; 
Note that ultrarelativistic electrons (which practically means any electron with T > 
10 MeV), 8 ~ 1 and E ~ T and we may relate the energy as 


E[GeV] ~ 0.3Bp[Tm]. (2.27) 


A 1 GeV electron will move in a 1 T field with a bending radius of 3.33 m. 


2.3.2 Conservation of Energy in Electromagnetic Fields 


A volume V with electric and/or magnetic fields present within it has associated electric 
and magnetic field energies Ug and Ug of 


1 1 
U = Up + UB = 50 | EPdv + = B?av. (2.28) 
2 Jy 24o Jy 


The energy density of an electric field (per unit volume) is therefore just €9£?/2 and that 
of a magnetic field is B?/2j19. Let’s now consider the rate of change of energy in a fixed 
volume. We start by considering a single charge q moving at some velocity v through an 
electromagnetic field (i.e. composed of both E and B components). The charge q may do 
work upon the field due to the Lorentz force acting upon the charge. The work done on 
the charge is (as for other forms of work) dW = F - dl, so that the power exerted by the 
electromagnetic field upon the charge can be determined as 


_ dW 
dt 
dl 
2 P22 IRS 2.29 
dr.v, (2.29) 
=q(E+vxB)-v 
= gE-v. (2.30) 


(Note that this power P is measured in watts.) We have cancelled out the second term (in 
B) and we see the well-known fact: magnetic fields never do any work. Only the electric 
field E can do work on a charge. We may express this idea in another way: integrating the 
work done over some path l, we have for the total work 


W= [w- fæa (2.31) 
l 
The work-energy theorem allows us to deduce that an electric field E(r) (a vector field over 
some space r) may be equivalently described by a scalar potential U(r) such that 
E = -VU. (2.32) 


It can be shown that U is unique at a particular point r. A charge moving from rı to r2 
sees a change in potential (‘voltage’) U; to Uz, and the work done is 


W = q(Uı -E U2). (2.33) 


A change in voltage AU gives a change in the kinetic energy of the particle AT = qAU. 
Note that here we adopt the convention U for voltage since we are using V for the volume; 
however, in the rest of this book we use V for voltage as usual. 
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If a positive charge is moving in the same direction as the electric field, it gains energy 
and the field E does work on the charge; if the charge moves opposite to the E field direction, 
then the charge does work on the field. The idea that an electric field E does work on a charge 
(or vice versa) implies that energy is exchanged from one to the other. For a distribution of 
charges moving in an electromagnetic field, we may calculate the power exerted upon (or 
by) those charges (which have charge density p) within an infinitesimal volume dV. This is 
just 


P= pdVE-v (2.34) 
=j-E. (2.35) 


This is easy to see since the total charge in the volume dV is just dq = pdV, and since the 
current density (current is just moving charge) is j = pvdV (if the volume is infinitesimal 
we may expect that the charges are all moving in the same direction and with the same 
speed). From this idea, we may now develop an energy conservation law. We start with two 
equations, Faraday’s Law: 


OB 
B.S] -_RBx E 2 
om Vx (2.36) 
and Ampere’s Law: 
OE . 


Adding these two equations together gives 


OB OE 
B- — + poe0E- = Loj: E (B-VxE-E.V xB), 


ot Ot 
10 10 
-I B.B ZL E.E = -mj E- V. (Ex B). 2, 
2 pi + Hoeo5 ag Hoj V: (Œ x B) (2.38) 
Let’s define a quantity that will be useful (now and later on) 
1 
S=—ExB. (2.39) 
Ho 


S is called the Poynting vector.* With this definition of the Poynting vector we can re- 
express our equation above as 
10 ( 
2 Ot 
This equation applies to an infinitesimal volume. Let’s integrate over some volume V (also 
dividing through by puo for convenience) which yields 


d ite, 1 :) fi | 
aE 4 BY ) dV = — -EdV — | V-SdV. 2.41 
dt Jy È ° 2o P v en 


This is beginning to look a bit like an equation about energy, but we’re not quite there yet. 
Our final step is to apply the divergence term to that last term in S (and also re-label those 
terms on the left-hand side) which gives 


d i 
qUe+Ue)=— | j Bav- $s-aa. (2.42) 


B? + uoco E’) = -uoj : E — V - (uoS). (2.40) 


dt 


* This quantity is named after its inventor Henry Poynting, so don’t mis-spell it as ‘pointing vector’ even 
though it’s quite tempting to. 
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This is definitely an equation about energy conservation. The left-hand side is the rate of 
change of energy in the field, which is balanced by the two terms on the right-hand side. 
The first is the rate of work done on any charges moving in the volume (as we saw above), 
whilst the second is what power is flowing out of the surface surrounding the volume V. 
Note that this implies that there is power present in an electromagnetic field, and that 
the flux of energy (and its direction) is given by the quantity S. The Poynting vector is 
pointing in the direction of energy flow. We may express our energy conservation equation 


in differential form as au 
pp VSRR (2.43) 


This is known as Poynting’s Theorem. 


2.3.3 Energy in Electromagnetic Plane Waves 


We very often have to deal in accelerator physics with the behaviour of plane waves, and so 
it is instructive to consider the energy embodied in them. Faraday’s Law and Ampere’s law 
may be manipulated together to obtain two wave equations, that describe how each vary 
with position and time in free space (i.e. away from where the currents and charges may 
have generated them) as 


OE 
2 — 
V E n Hoe -oz = 0, (2.44) 
3?B 
V°B- Moco = 0, (2.45) 


solutions to which have the form of waves travelling at speed c = 1/,/fipéo. In a dielectric 
(non-conducting) material we have a modified permittivity co + e€€9 and permeability po > 
uuo where € and p are the relative permittivity and permeability characteristic of the 
particular dielectric; the material modifies the electric and magnetic fields to 


D = cegE (2.46) 
1 
H = —-B (2.47) 
HHO 
such that the wave equations become 
OPE 
2 = 
VE — HHoceo zz = 0, (2.48) 
3?B 
2 = 
V B = BHOEEO" Ag = 0. (2.49) 


In other words, in a dielectric an electromagnetic wave propagates at a lower speed 


1 C c 
= Apoo yE n’ 


where the refractive index n of the dielectric is given by n = yue. Very often u ~ 1 and can 
be omitted from equations, and the permittivity can be frequency-dependent — i.e. € = e( f) 
— which leads to the important property of wave dispersion; dispersion is where waves of 
different frequencies propagate at different velocities. In most accelerator applications we 
are dealing with electromagnetic waves propagating in a very good vacuum (much less than 
1 mbar), so that € = u = 1 to a very good accuracy; however, in waveguides (see Chapter 3) 
the effective wave velocity can be much lower (see below), and in dielectric and plasma 


v (2.50) 
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accelerators the driving laser will propagate with a modified velocity due to the material 
being traversed. 
A plane wave is a simple solution to the wave equations that has the form 


Es = E,(z,¢), 
Ey = Ey(z,t); (2.51) 


in other words, there is no variation of the field in the x and y directions. Solutions of the 
form 
E,=f(z-ct) or E,=g(z+ct) (2.52) 


are allowed where f and g are arbitrary functions (similar equations may be written for 
E,). The first solution describes a disturbance moving in the +z direction whilst the second 
describes a disturbance moving in the —z direction (see illustration in Fig 2.4). We may 
build up any function f(z — ct) or g(z — ct) in terms of different-frequency components 


E= Eye!) (2.53) 
where 
Ezo 
Eo = l 2.54 
: ( Eyo ) l ) 


Again, components of the form (wt — kz) describe disturbances moving in the +z direction, 
whilst components of the form (wt + kz) describe disturbances moving in the —z direction. 
We define the dispersion relation for a particular frequency component w as 


w 1 


= —— => —. 2.55 
"T k yame (2.55) 


k is the wavenumber (or ‘wavevector’) with an associated wavelength \ = 27/k. 


A? 


FIGURE 2.4 Illustration of an arbitrary electric field Ey = Ey (z i t), which will satisfy the wave 
equation if of the form Ey = f(z — ct) or Ey = g(z + ct) for any functions f and g; f and g in turn 
are described in terms of their frequency components where each frequency W may propagate at a different 
velocity according to the particular dispersion relation for that material, leading in general to dispersion 
(separation over distance) of the different frequency components. 


Similar to the electric field, we may define plane wave solutions for the magnetic field as 


B = Boett?) (2.56) 
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However, we already know that varying magnetic and electric fields are coupled together 


through Faraday’s Law as 
OB 
VxE=-—. 2.57 
x J (2.57) 
Substituting in separately our two solutions for the electric and magnetic fields into Fara- 
day’s Law, we obtain 


iw( Brog ae Bof) tra) = a g ra 
Eso Ep 0 


= 2(-ikEy) +9 


oN 


—ikEz0), (2.58) 


where $ and ¥ are unit vectors transverse to the direction of motion of the electromagnetic 
wave. Matching terms in X and ¥, we find 


k 

Brzo = — y0» (2.59) 
Ww 
k 

Byo = — Dro. (2.60) 
Ww 


The electric field in a given direction is coupled to the magnetic field at 90° to it. We may 
combine these two equations into one as 


k 
Bo = —Î x Eo, (2.61) 
W 


where Z is a unit vector along z. We see straightforwardly that 


k k 
E-B=-— (E2) Exo + (Er) Eyo = 0. (2.62) 
w w 
Hence E is perpendicular to B and 
k E 
B=- p; (2.63) 
w c 


We see therefore that the two field components in an electromagnetic wave are coupled 
together as they travel. But what are their typical relative magnitudes? As an example, we 
consider a radio antenna emitting electromagnetic radiation which at some distance has a 
peak electric field strength of Eo ~ 3 x 1073 Vm~! = 3 mVm™!. This electric field — which 
whilst small is still measurable — is much, much bigger than the corresponding magnetic 
field in the same region, where Bọ ~ 1071! T. A contrasting situation is that of a high-power 
laser that may drive a wakefield particle accelerator. It turns out that the typical electric 
field strength at the laser focus (which then drives the particle acceleration*) has values 
that may readily exceed Ey ~ 109 Vm~! = 1 GVm7". In this situation the corresponding 
magnetic field is quite large — with By ~ 3 T. As we will see in Chapter 4 such fields are 
quite challenging to generate using electromagnets, but arise naturally at the focus of a very 
strong laser pulse. 

We have carried out our derivation linking the electric and magnetic fields by assuming 
that both are travelling along the z direction. However, it should be obvious that we may 


*A nice example of how the energy in an electromagnetic field can do work on charges and thereby pass 
energy to them. 
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equivalently have a plane-polarised wave in an arbitrary direction given by the unit vector 
k. Now, the electric and magnetic fields may be written as 


E(r, t) = Ep tten), (2.64) 
la 1 

B(r,t)= -k x E = -k x E. (2.65) 
c Ww 


We now consider a plane electromagnetic wave (in a vacuum) travelling in the @ direction 
and assume a linearly-polarised electric field that lies in the x plane, so that 


E = E£oxX cos(wt — kz), (2.66) 
B = Bof cos(wt — kz), (2.67) 


and where By = Eo/c as shown above; we note that E and B oscillate in phase with each 
other, which we haven’t pointed out before now but which is generally true in a vacuum. 
We see from the definition of E and B that the (volumetric) energy density at a given value 
of z is just given by 


1 
Up = 560% cos?(wt — kz), (2.68) 
1 
Ug = — B? cos? (wt — kz). (2.69) 
2140 


It is left as an exercise for the reader to confirm that Ug = Ug. In other words, in a plane 
electromagnetic wave there is equal energy contained in the E and B fields, despite the 
very large disparity in the magnitudes of the actual field strengths. Given that the two 
energy densities are the same, we may combine them to obtain the total energy in the 
electromagnetic wave, 

U = Ug + Up = eo FE} cos? (wt — kz). (2.70) 


U varies both as a function of time t (at a given z) and as a function of position z (at a 
given t); this is illustrated in Fig 2.5. We may readily calculate the time average of U as 


1 
(U) = eo Eb (cos? (wt — kz)) = 5608 = eg. (2.71) 


(U) = e9E?,,,, is obtained because Ems; = Eo/V2. Note the various factors of 2 that appear 
and disappear in these expressions, so care must be taken. 


U A at fixed t U . at fixed z 


, A 


FIGURE 2.5 Illustration showing how the energy density U varies either with position (at a given time) 


or with time (at a given location); the energy density at a fixed location is not constant, but varies with 
time. 


The average energy density of the electromagnetic wave (U) has units of Jm~%. But we 
also know that, since it’s a wave, it is moving at velocity c. Hence the energy flux (rate 
of energy motion) has units Jn~?xms~! =Jm~?s~!. Above, we showed that the Poynting 
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vector S = (E x B)/,u9 was the energy flux of an electromagnetic wave (as we see, pointing 
in a direction perpendicular to both E and B. In the case here of a plane electromagnetic 
wave we have that g i 
S = — EB? = — Fo Bo cos’? (wt — kz)ĉ. (2.72) 
Ho Ho 
z is the direction of energy flow, which is the same as the direction of wave propagation. 
Averaging over time, we see that 
1 1 
(S) = — Eo Bos (2.73) 
Ho 2 
where the factor 1/2 comes from the time average of the cos? term. We may then substitute 
and re-arrange to obtain 


ep Hg = c (U) (2.74) 


since c = 1/,/fipéo and (U) = eo E/2. This is a nice result, since it says that an electromag- 
netic wave with energy density (U) transfers that energy to another location at velocity c, 
which is what we would expect. The energy flux is 


(S) = c (U). (2.75) 


2.3.4 Radiation Pressure 


We have just derived an expression that relates the energy density of an electromagnetic 
wave to its energy flux S (rate of energy flow from one place to another); we did this for a 
plane electromagnetic wave but it also applies in other situations. Electromagnetic waves of 
various sorts transfer energy at a speed c, and include such practical devices as TV and radio 
transmitters, mobile phones (which are of course just miniature transceivers*), microwave 
ovens (that transmit energy from an electromagnetic wave generator into a target — your 
dinner), and more esoteric devices such as ray-guns. 

We realise that electromagnetic waves carry not only energy but also momentum. Here, 
we think of the electromagnetic wave as being composed equivalently as a fluence of pho- 
tons.* Of course, we know that for any particle its energy E is 


FP? = pP + mic, (2.76) 


and that for photons mo = 0 and hence E = pc, so that for a given energy E we have 
p = E/c. With this idea, we can consider a volume of space that contains an electromagnetic 
wave (that has an energy density) and from that define a momentum density which we will 
label P, to distinguish it from the other variables also labelled with a ‘p’ Momentum has 
units kgms™t, so that the momentum density P, must have units kg m s71/m? =kg m~? 
at 

Since p = E/c for an individual photon, we can readily write down that the magnitude 
of the momentum density is 


|Pa| = m = >i (2.77) 


*A transceiver is a device that both transmits and receives. 

* The fluence of something is the number passing through a given area per unit time, as opposed to the 
flux which is the total quantity of something such as energy that passes through a given area per unit 
time. 
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We can now define a radiation pressure, which must be related to the momentum trans- 
ferred by the electromagnetic wave when incident upon some area. Pressure = Force/Area 
(as we know very well); to determine that pressure, we first calculate the total momentum 
transferred in some time At through some surface A (see Fig 2.6). The volume of electro- 
magnetic field that passes through A is just V = AcAt, so that the total momentum pr 
(the impulse) transferred through A is 


pr = P,AcAt (2.78) 


(momentum density x volume). But the impulse pr is just pr = F'At, where F is the total 
force acting over the surface A. In other words 


F = P,Ae. (2.79) 


The radiation pressure P, may then be simply obtained as 
F 
P= == Pes U} (2.80) 
A 
The radiation pressure is equal to the energy density — an important result! 
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FIGURE 2.6 Illustration showing how a set of photons of momentum p = E / c may impart a radiation 
pressure on a surface A. The volume traced out by the photons in a time At is V = AcAt. 


We can summarise the relationship between radiation pressure P, and the electromag- 
netic field quantities as 


P, = (0) = [El = ZE x B). (2.81) 


Examples of Radiation Pressure 


Our first example of radiation pressure is that of an electromagnetic plane wave incident 
upon a perfectly reflecting mirror. We recall that since the photons bounce off the mirror 
and then travel backwards, the momentum transferred is twice what it would be if they 
were just absorbed. Hence the radiation pressure is 


Rage, (2.82) 


This is an important phenomenon in particle accelerators. For example, an accelerating 
RF cavity will experience a force on its walls due to the electromagnetic waves that are 
confined within it deforming its shape and changing the resonant frequency, known as 
Lorentz force detuning; the conducting walls act as a mirror to the photons within the 
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cavity. One counterintuitive — yet true — consequence of this is the radiation pressure within 
an ordinary microwave oven. With the door closed, there is equal pressure on all the interior 
surfaces of the oven (including the reflecting door) due to the ~2.45 GHz photons trapped 
inside. If the door interlock is over-ridden so that the microwave power is still fed in whilst 
the door is open, then the photons coming out through the door will no longer be reflected 
and there will be a net ‘thrust’ on the microwave oven; an open microwave oven in space 
will move if power is fed into it. The corollary of this is that so-called reactionless rockets 
(where the idea is of an enclosed cavity providing thrust) cannot possibly work — they would 
violate conservation of momentum. 

Another example is that of an intense laser pulse. We consider a typical COg2 laser pulse 
with wavelength À ~ 10 um= 107° m, pulse length 7r ~ 10 ns, beam radius ~1 cm, and 
pulse energy of 100 J. The energy density in such a pulse (U) is 


100 


(U) = -ag > 10° Jm™?. (2.83) 
Therefore 
(S) =c(U) ~3.2x 10% Wm”, (2.84) 
and the radiation pressure is 
P, = (3) ~ 10° Nm”, (2.85) 


This pressure is acting over a 1 cm diameter spot focus, which means the total force is 
about 32 N. In other words, for 10 ns the laser pushes on that spot focus with the weight 
of a 3 kg object. This feature of laser pulses is important in laser-driven acceleration, since 
the intense radiation pressure from the photons falling onto a (thin) target can be sufficient 
to push the target away from its original position. 


2.4 The Basics of Acceleration 


The purpose of a particle accelerator is to deliver particles with a chosen amount of kinetic 
energy; those particles are usually in the form of a beam, i.e. a ‘stream’ of particles extended 
over time. We saw that charged particles may have their kinetic energy increased by means 
of an electric field. The simplest situation is that of a potential difference through which a 
charge travels; for example, a negatively-charged electron will accelerate towards a positive 
potential (see Fig 2.7). The electron volt (eV) is defined as that energy gained by a unit 
charge e = 1.602 x 10~!° coulomb crossing a potential difference of one volt; 1 eV is equal to 
1.602 x 10719 J. The electron-volt is the standard unit of measure in particle accelerators, 
although we typically work with MeV (million eV) or GeV (billion eV) energies. The kinetic 
energy gained is AE = qV. 

Particles can be accelerated with any suitable electric field. In the earliest accelerators a 
static DC potential difference was used (see Chapter 3), but today the predominant method 
is to utilise time-varying, oscillatory voltages created in resonant cavities; the requirement 
for particles to pass at the right time to be in phase with this oscillatory voltage is why 
most accelerators deliver bunched beams of particles. These cavities typically have resonant 
frequencies in the radio frequency (RF) part of the electromagnetic spectrum and are there- 
fore known as RF cavities; these are described in detail in the next chapter. RF cavities 
obtain peak electric fields that are limited to ~200 MV/m, resulting in an average accel- 
erating field of ~100 MV/m, and for higher accelerating fields there has been significant 
interest in inducing charge separation in plasmas — for example using an intense laser pulse 
to separate the plasma electrons from the ions — and thereby create a transient electric field 
exceeding 1 GV/m in some cases. We outline this method too. Very often, however, such 
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large gradients are not required since we may recirculate the particles within a circular ac- 
celerator to use a smaller voltage multiple times; for example, in the cyclotron, protons are 
typically accelerated across a ~100 kV Dee gap and make around 1000 revolutions before 
being extracted with a kinetic energy of around 100 MeV or more (see Fig 2.8). 

The cyclotron was the first circular accelerator — developed originally in the 1930s by 
Ernest Lawrence and M. Stanley Livingston — and relied on Ernest Lawrence’s great insight 
about the bending radius of a classically-moving charged particle. We saw above that the 
bending radius p (say, of a proton) in a magnetic field is 


MU 


= (2.86) 


p 


The time taken for one orbit in the cyclotron is tp = 27r/v, so that the proton gyrates in 
the field B at the cyclotron frequency 


| ee eal (2.87) 


For protons moving transversely to a magnetic field of 1 T, we have fe ~ 15.3 MHz. We see 
immediately that the cyclotron frequency is independent of the velocity — as long as the mass 
of the particle doesn’t change; this is highly advantageous as it allows a constant-frequency 
signal generator to be used to feed the voltage at the cyclotron Dees (Fig 2.8). This, in turn, 
allows the use of modest accelerating voltages, today typically tens of kilovolts. Another 
important observation is that the size (i.e. diameter) of a cyclotron scales x 1/B; larger 
magnetic fields give a smaller accelerator. Reducing the size of an accelerator is a common 
aspiration; for a linear accelerator this means maximising electric field gradient or for a 
circular accelerator a large B is beneficial. 


FIGURE 2.7 Illustration of how a charge changes energy due to a voltage difference from 0 to +V. 
Here, an electron (with negative charge q) is accelerated upwards by the force F = qE; crossing from one 
voltage to the other gives an energy gain AE = qV. 


As a particle is accelerated, its mass increases as m = ymo, and the cyclotron will no 
longer work isochronously (i.e. with a constant-frequency RF acceleration); indeed, electrons 
with kinetic energies of even a few hundred keV are moving close to c, and so effectively 
there is no such thing as an electron cyclotron (although there are such things as ECR 
— electron cyclotron resonance — ion sources). Above about y = 1.3 we must change the 
accelerating (RF) frequency to maintain synchronism with the accelerating bunches; in 
a synchrocyclotron the Dee frequency matches the revolution frequency, but only of one 
accelerated bunch at a time — the maximum bunch extraction rate is therefore the rate 
at which the Dee frequency can be ramped up and down, typically about 1 kHz. In 1945 
Vladimir Veksler and Ed McMillan independently realised the principle of phase stability [5, 
6], and this was demonstrated in 1946 on the first synchrocyclotron — adapted from the 
earlier 37-inch cyclotron at Berkeley. 
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FIGURE 2.8 Illustration of the (classical) cyclotron, where the upper (N) pole has been raised to show 
the internal layout. A uniform vertical magnetic field B created by the N and S poles confines protons 
(orbiting horizontally within the vacuum vessel) into a circular path of radius p = mw /qB. At each Dee 
gap crossing, the protons gain an energy qV for a Dee voltage V; the voltage polarity must therefore be 
swapped at each side of the Dee crossing, meaning that the Dee frequency is the same as the cyclotron 
frequency fe (or it can be some integer multiple h of it). As the protons accelerate they gain energy 
and increase in radius p, but retain the same fe as long as their mass does not increase significantly. Many 
bunches at different energies and radii can co-exist simultaneously in such a cyclotron, each bunch eventually 
being extracted at the outer radius of the magnet. 


The synchrotron improves upon the synchrocyclotron by also varying the magnetic field 
B = B(t) with time; here, the path of the particles through the magnets is kept constant 
as the particle energy increases and the RF is matched to be frr = hfr where f, is the 
(orbital) revolution frequency and the harmonic number h is an integer. An illustration is 
given in Fig 2.9. The betatron — invented by Donald Kerst also in the 1930s — is similar in 
that it circulates charged particles (here electrons) at a constant radius, but uses induction 
acceleration via an e.m.f. generated as the magnetic field itself varies. Frank Goward and D. 
E. Barnes adapted a betatron to build the first synchrotron in 1946 at Woolwich (London) 
which accelerated 8 MeV electrons, and the following year an electron synchrotron at General 
Electric’s laboratory demonstrated the production of synchrotron radiation (see Chapter 6). 
By maintaining a constant beam path that is independent of particle energy, the magnet 
sizes can be enormously reduced particularly at high energies enabling the very largest 
colliders such as the LHC to be produced with a realistic cost. The other great advance 
made around the same time (in 1949) was Nicholas Christofilos’s strong-focusing principle, 
which allows the circulating beam size to be greatly reduced, making the magnets much 
smaller again; this is discussed later in Chapter 5. 


2.5 The Particles Used in Accelerators 


Since this book is all about particle accelerators we also need to consider which particles to 
use. Any charged particle can be accelerated using an electric field; in the broadest sense 
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Extraction 


FIGURE 2.9 Illustration of a synchrotron, which uses a number of dipole magnets whose field strength 
B varies with time; the momentum of the particle p = qpB follows the magnetic field strength. Strong 
focusing — either within the dipoles or here using additional quadrupoles — provides a small beam envelope 
and thereby a small magnet aperture. Injection and extraction is typically done with pulsed magnetic 
elements but may also be performed with so-called stripper (charge exchange) foils. 


they behave similarly. The differences lie in: the convenience with which they may be ob- 
tained; their mass and charge, which determines the acceleration for a given electromagnetic 
field; and whether they are stable. The most common particle to accelerate therefore is the 
electron, since it is relatively easy to liberate electrons from a surface by simply heating 
it and applying a voltage to it; they are the lightest charged particle, and so are also the 
easiest to make relativistic. We should mention here the positron, the antimatter pair to 
the electron. From the accelerator’s point of view they behave exactly the same, except 
that since they have the opposite charge (+e) they require the opposite polarity for all the 
fields; this is mostly readily achieved by ‘swapping the connections’ on all the power sup- 
plies. To make protons we can either directly use a radioactive source of 8+ particles (such 
as sodium-22) or create them in larger numbers using pair production in a suitable target 
but these methods are not trivial; since positrons for the most part give similar phenomena 
in accelerators, we rarely use them and instead prefer electrons. The most common use 
of positrons in an accelerator science is in electron-positron colliders where both particles 
are accelerated and then made to collide into each other for fundamental particle physics 
studies. 

The second most common particle to accelerate is the proton. As we saw earlier in this 
chapter, they are much more massive than an electron (m,/me ~ 938 MeV/0.511 MeV = 
1836) and so making them relativistic is harder; we must also account for the varying velocity 
as described earlier. Protons can be generated in an ion source by ionising hydrogen gas 
with a suitable large voltage discharge; ion sources are briefly described in Chapter 3. H7 
ions are also often used, as they can allow more intense beams to be more efficiently injected 
or extracted in an accelerator system; a thin stripper foil (perhaps of graphite, aluminium 
oxide or other robust material) can be placed into the H~ beam causing the electrons to be 
lost but transmitting most of the remaining protons. 

Other particles which may be accelerated include atomic ions, for example, carbon ions 
for particle radiotherapy or heavy ions such as gold, lead, or uranium for nuclear physics 
applications. Since the atoms of all the elements, apart from hydrogen, have more than one 
electron it is possible for ions to have multiple charges. In other words, since lithium has 
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three electrons it is possible to create lithium ions in three different positive charge states 
(Lit, Li?™, and Li?+) depending on how many electrons are removed from the atom. Within 
the same electric field, these three ion states will gain kinetic energy proportional to their 
charge state. This ability to impart greater kinetic energy to higher charge states is taken 
advantage of by choosing to work with extreme cases such as 7°Ur??+. It is usual for the 
kinetic energy gained in an accelerator by an ion to be quoted per nucleon in units of MeV/u 
(a nucleon is a proton or a neutron, so there are 238 nucleons in this ion). We generally 
ignore the small difference between the atomic mass unit, u, and the actual nucleon mass 
for the ion. 

Finally, we give an example of exotic particle acceleration: the muon. An elementary 
particle similar to the electron, with the same charge but about 207 times the mass. At the 
same kinetic energy, muons radiate far less synchrotron radiation (a factor 2074 less — see 
Chapter 6) making a muon-muon collider an attractive prospect. However, at rest, muons 
have a lifetime of only around 2.2 us before they decay, and so must be accelerated rapidly 
to large y to extend their lifetime via time dilation. No one has yet decided to build such a 
collider. A possible first step, under consideration, would be to build a muon storage ring to 
generate intense beams of neutrinos, also for fundamental particle physics measurements. 


2.6 The End of ABC 


This concludes our introduction to the field of accelerators, our ABC. In the following 
chapters we discuss the principles of the common elements used in nearly all accelerators 
— the RF acceleration, the magnet systems, the beam dynamics needed to understand and 
specify these systems, the radiation the particles may produce and what happens when we 
have many particles in our bunch. We shall start with the heart of any accelerator, the 
accelerating structure! 
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The vast majority of particle accelerators are designed to increase the kinetic energy of 
charged particles, usually in the form of a particle beam. This is performed by placing those 
charged particles within a suitable electric field. In this chapter we look at several ways 
of applying electric fields to charged particles to provide efficient and stable acceleration. 
First we will examine electrostatic accelerators and their limitations, before moving on to 
radio-frequency (RF) accelerators. 
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3.1 Electrostatic Accelerators 


The simplest type of particle accelerator can be constructed from two metal plates, an anode 
and a cathode, separated by a vacuum section and held at different electric potentials by 
some external voltage source. This simple idea is the foundation of many low-energy or 
early particle accelerators. There are three possible configurations for creating the very 
high voltages required for MeV scale accelerators: 


1. Van de Graaff Generators (invented by Robert Van de Graaff in 1929), which transfer 
charge to a high-voltage terminal via a belt; 


2. DC power converters which convert a low voltage and high current, to a high voltage 
at low current; 


3. pulsed modulators, which store energy in capacitors or inductors and release it quickly 
at a higher voltage. 


3.1.1 DC Power Converters 


DC power converters operating up to 600 kV are readily available at GW of power in the 
electricity industry and have undergone significant development in recent years for high- 
voltage DC transmission. Typically, the input to the system is a 3-phase AC input* which 
is the common method of supplying power from a national electricity grid at large currents 
and voltages to high-power machinery. The AC signal will be rectified to DC using a full- 
wave diode rectifier. However, simply using a rectifier would have too much power ripple so 
a low-pass filter is also necessary to remove the AC frequency and higher harmonics from 
the output. The size of capacitors and inductors required for the smoothing is inversely 
proportional to the AC frequency, hence a higher AC frequency is often used. In order to 
do this, an AC-AC bridge converter is used. In this device switchable diodes or switches 
such as thyristors are used to first rectify the input AC frequency to a DC signal and then 
fast switches are used to chop to AC at a higher frequency. In order to create a higher DC 
voltage than the input voltage, a boost converter is used, as shown in Fig 3.1. Here the load 
resistance is placed in series with a large inductance. A switch is placed in parallel with 
the resistance such that, when closed, the current will bypass the load and the inductor 
will draw a high current. When the switch is opened the load will draw current from both 
the input supply and from the discharging inductor creating a higher voltage. A capacitor 
can also be used in parallel with the load to smooth the voltage out so that the load sees 
a roughly constant voltage. The ratio of the output voltage, Vout, to input voltage, Vin, is 
equal to the time the converter is in the off state (when the switch is open), Tog, divided 
by the switching period, T [1], 
T 
Vout = Ving (3.1) 
A DC-DC converter is limited by the maximum voltage that the switch can handle. 
If particle energies higher than 600 keV are required, then a Cockcroft-Walton voltage 
multiplier can be utilised [2]. This uses an AC supply or pulsed DC to charge an arrangement 
of capacitors and diodes, as shown in Fig 3.2. During the first half cycle the first capacitor 
charges when a negative voltage is applied over it. The diodes ensure that the second 
capacitor is isolated during this step. In the 2nd half cycle, the polarity is reversed and 
the 2nd capacitor is charged by the AC supply and the discharging 1st capacitor, thereby 


*For example, in the UK the 3-phase supply is 415 V at 50 Hz. 


Acceleration 31 


DC DC 


On state Off state 


FIGURE 3.1 Circuit diagram of a boost converter showing the ‘on’ state and the ‘off’ state. 
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FIGURE 3.2 Circuit diagram of a Cockcroft-Walton voltage multiplier. 


providing twice the charging voltage. As the second capacitor only has a positive voltage 
applied across it, it will have a DC output if the time constant of the capacitor discharging is 
longer than the switching period. Multiple stages can be provided so that the multiplication 
increases with each stage. This concept was first utilised in 1932 by John Cockcroft and 
Ernest Walton in the first nuclear disintegration experiments using 1 MeV beams. They are 
still in use today in many proton and ion accelerators. The output voltage for a Cockcroft- 
Walton with n capacitors, with a supply peak-to-peak voltage of Vpp is given by [3] 
3 2 T 


o m n? n? Nene i 
Vout = (Z) Vor E 7 — FD? — 3D + 1) | Z Tout, (3.2) 


where T is the switching period, D is the duty cycle (D = Ton /T), C is the capacitance 
and Tout is the current drawn by the accelerator. 


3.1.2 Pulsed Modulators 


Another method of generating high voltages is to store energy in a capacitor bank over a 
long period of time and discharge it over a shorter timescale providing a high voltage and 
current simultaneously for a short period. Several MW or even GW of power in nanosecond 
to microsecond pulses can be created using this method. The most common topology for 
this is the Marx bank generator, invented by Edwin Otto Marx in 1924 [4]. In a Marx bank, 
a number of capacitors are charged in parallel from a DC supply using the circuit shown 
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FIGURE 3.3 Circuit diagram of a Marx bank generator, with N stages. 


in Fig 3.3. A number of switches are also employed such that, when closed, the capacitors 
become connected in series rather than parallel so that they can supply a voltage equal to 
the supply voltage multiplied by the number of capacitors. In many configurations, spark 
gaps are used instead of switches such that when the capacitors reach a given voltage they 
automatically conduct providing the series connection. Another type of pulsed modulator 
is the line type modulator, where the energy is stored in a transmission line, made of a 
network of capacitors and inductors or coaxial line, such that a square pulse is produced of 
duration equal to twice the line length divided by the velocity of the pulse on the line. 


3.2 Particle Emission 


At the start of all accelerators is a source of charged particles, either electrons, protons, ions 
or negative ions. These sources can provide a continuous or pulsed emission as required. 


3.2.1 Electron Emission 


In order to accelerate a beam of charged particles we must first obtain charged particles 
in vacuum. We cannot create charged particles from nothing so they must be either moved 
from somewhere or created in a nuclear or ionising reaction from neutral particles or in 
pair production. Electrons most commonly are emitted from a metal or semiconducting 
cathode. Conducting metals or doped semiconductors contain a number of free electrons 
in the conduction band; however, these are not able to escape the material due to a finite 
work function, which is the energy required above the Fermi level (which is the energy of 
the highest conduction band) to remove an electron from the material to the surrounding 
vacuum. To create free electrons in a vacuum we must provide enough energy to the electrons 
to allow them to overcome the work function. This can be achieved by either heating the 
emitter or with photons via the photoelectric effect. Emission via heating the emitter is 
known as thermionic emission and was first observed by Edmond Becquerel in 1853, and 
the British physicist Owen Willans Richardson received the Nobel Prize in 1928 for his 
pioneering work on the subject and the development of the Richardson law, which gives the 
current density, J, from an emitter as a function of the cathode temperature, T, and work 
function, dy, as 

J = AT? etw lk (3.3) 


where T is the cathode temperature, dw is the work function of the material (for example, 
copper has dw ~ 4.7 eV), and k = 1.38 x 10-78 JKT! is Boltzmann’s constant. A is a 
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material-specific constant that has typical values ~ 3 — 17 x 10°Am~?K~?, and is equal to 
~ 6 x 10°Am~?K~? for tungsten. 

Unfortunately the Richardson equation is only true for low beam currents or high tem- 
peratures due to the space charge of the emitted electron beam. Emitted electrons repel 
electrons near the surface reducing the current that can be emitted. This effect can be 
overcome, however, by putting the emitter at a negative potential with respect to an anode, 
thereby creating an electric field which accelerates the electrons from the emitter (cathode) 
to the anode. By applying the electric field of a continuous electron beam to Maxwell’s 
equations, Child and Langmuir derived an equation for the maximum current density, J 
(in A/m?), which can be emitted from a cathode as a function of the applied potential 
difference, V, and gap, d, between the anode and the cathode [5]. For two parallel plates, 
the Child-Langmuir law is 


J= 


4 2 1/2 y3/2 y3/2 
o( £ ) . (3.4) 


= 2.33 x 107° 
9 Me d? d? 
It is common to provide a constant of proportionality between the current and voltage, 
known as the perveance, P, with units of Perv such that 


I= Py?/?, (3.5) 
where for parallel plates, the perveance is given by 


p- 2.33 x 1076A. 


2 (3.6) 


and A, is the emission area. 

Hence, we arrive at two regimes of electron emission, each limited by one of the two 
equations above: temperature-limited emission (for high voltages where the space-charge 
does not limit the emission current density), and space-charge limited emission (for high 
temperatures where the temperature doesn’t limit emission). In space-charge limited emis- 
sion we can turn the emission of electrons on and off by modulating the applied voltage. 
Typically for thermionic emission the current density is limited to a few A/cm? (typically 
around 10 A/cm?) to ensure the cathode doesn’t degrade too quickly due to high temper- 
ature operation. 

Electrons can also be emitted using the photoelectric effect known as photo-emission, 
where a photon is absorbed and an electron is emitted as a consequence. The process is 
quantified by the quantum efficiency 7 of the photocathode which is the average number of 
electrons emitted for each incident photon; normally 7 < 1. The quantum efficiency depends 
on the photocathode material, laser wavelength, accelerating field at the photocathode and 
the vacuum environment. Photocathodes can be metals or semiconductors. Metal photo- 
cathodes have long lifetimes and are very simple but have very low quantum efficiency; for 
example, copper or molybdenum photocathodes both have 7 ~ 0.001 %. Semiconductor 
photocathodes such as GaAs or Cs2Te can have orders of magnitude higher quantum effi- 
ciency 7 ~ 10 %, but their lifetimes are lower such that their quantum efficiency can drop 
to a few % in a matter of days [6]. 

It is also possible to emit electrons from a cathode via quantum tunnelling through the 
potential barrier created by the work function; this is known as Fowler-Nordheim tunnelling, 
or more commonly in the accelerator community as field emission. The potential barrier is 
normally very wide; however, if we apply a potential difference between an anode and a 
cathode the potential must go linearly from the work function at the cathode to the work 
function minus the potential difference at the anode. When the potential across the vacuum 
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gap drops below the Fermi level it is possible for electrons to tunnel across the distance 
to this point from the cathode. The higher the potential difference, or the smaller the gap 
between the anode and cathode, the smaller the distance the electrons need to tunnel and 
the higher the probability of an electron tunnelling. Electron emission via this mechanism 
is known as field emission. If the electric field at the cathode is ~ 100 MV/m or higher, 
and locally the field can be much larger on the nanometer scale due to surface roughness, 
then very large currents can be produced by this phenomena. The current density, Jpn, in 
A/m? which is produced by a field Epiat in V/m, is given by Fowler-Nordheim theory for a 
triangular barrier as [7] 


= (BE fiat)” Brynów. 
Jrn(E) = Arn dw exp ( BrE fiat 4 (3.7) 


where Apy ~ 1.54 x 1078A eV/V? and Bry ~ 6.83 x 10°eV3/ V/m; bp is a field enhance- 
ment factor, and w is the work function in eV. As the emission is dependent on the electric 
field at the cathode, this emission can be highly dependent on the geometry. Geometries 
which provide higher electric fields at the cathode for a given potential difference produce 
more current. One geometry that provides a very high local electric field at the cathode 
is a whisker or rod which is smaller than the anode cathode gap but with a large ratio of 
length to radius. Such a geometry can provide a local electric field at the cathode surface, 
Focal several times higher than that of a flat surface, E’fiat, known as the field enhance- 
ment factor 6 so that Elocaı = fE fiat. Such whiskers can occur in manufacturing or by 
damage to a surface on the micron or nanometre scale, giving a higher local electric field on 
the surface than expected. Field emission can give very high current densities compared to 
thermionic emission but it is more difficult to produce large emission areas, with high field 
enhancement factors. In particle accelerators, operating with high electric fields, this effect 
can be unwanted as the surfaces of RF cavities themselves can emit electrons which can be 
captured along with the beam in the strong RF fields. These will eventually drift off the 
beam trajectory and will deposit their energy in whatever they collide with [8]. Field emis- 
sion can also occur alongside photo-emission in photocathodes degrading the beam quality 
through unwanted parasitic emission. 

Once the electrons have been emitted it is necessary to remove them before they impact 
the anode so they can be further accelerated. This can be achieved by placing a hole in the 
aperture connected to a conducting beam tube such that the electrons can travel along this 
tube to other accelerator components. As particles with like charge repel each other the 
electron bunch will blow up (increase in emittance) between the cathode and the anode. 
Fortunately, the beam will also have a magnetic field due to the motion of the electrons. 
The force due to the electron’s electric and magnetic fields, known as the space-charge field, 
cancel each other completely when the beam is travelling at the speed of light c, and partially 
when the beam is travelling slower than c. In addition, the electrons become heavier due 
to relativity, hence the effects of space charge on electrons is much more significant at low 
energy below a few MeV, hence accelerating faster with higher electric fields can minimise 
the effect; this is covered in more detail in Chapter 7. If the current is well known and the 
beam is continuous in time, this can be compensated for by curving the cathode and anode 
to cancel the beam’s own space-charge field. This was studied by Pierce who developed the 
Pierce electrode geometry which is placed at an angle to the cathode to cancel the space- 
charge field. However, many electron sources (often called electron guns) are required to 
produce short pulses of electrons in a beam; in this case the beam should require magnetic 
focusing to compensate and minimise the beam blow-up. 
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FIGURE 3.4 a) Schematic of the ISIS Penning ion source b) A photograph of the ISIS Penning source. 
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3.2.2 Ion Sources 


As their name suggests, ion sources provide either positively- or negatively-charged ion 
species. They are typically composed of two parts: a plasma source (in a chamber) and an 
extraction system to remove the desired ions from that plasma. The gas can be ionised to 
create the plasma by either applying a large electric field which creates tunnel ionisation 
and pulls the positive and negatively charged particles in opposite directions, or by collision 
with an electron which knocks out a bound electron, known as impact ionisation. It is also 
possible to ionise a gas via electron capture, to create a negatively-charged ion. Once ionised, 
a DC accelerator can separate the electrons and the ions. The most common methods used 
in particle accelerators for generating positive ions are electron bombardment, plasmatrons, 
microwave, electron beam, laser and vacuum arc [9]. Common methods for producing H~ are 
surface plasma cold cathodes and multicusp sources. A common example of an ion source is 
the PIG (Penning Ionisation Gauge) proton source, within which is a small chamber (several 
millimetres across) that contains hydrogen gas fed in continuously at a known small rate, 
usually by means of a mass-flow controller. The gas volume has a flat cathode at each end 
and a cylindrical anode between, with around 2 kV between them. Electrons emitted from 
these (cold) cathodes take long, helical paths toward the anode due to an additional applied 
magnetic field applied across the electric field (hence these are cross-field devices), which 
then create ions via impact ionisation. The PIG source for the ISIS accelerator is shown in 
Fig 3.4. 


3.3 Radio-Frequency Acceleration 


The maximum accelerating field of an electrostatic accelerator is limited by the DC Kil- 
patrick criterion [10] (not to be confused with the RF Kilpatrick criterion given later in 
this chapter), an empirical formula devised in the 1950s by W.D. Kilpatrick; the maximum 
voltage V and gradient E satisfy the inequality 


1.7 x 107 


E? 
V E* exp ( E 


) < 1.8 x 10"; (3.8) 
this is an empirical fit, where V is given in V and E in V/m. It shows the accelerating field 
is dependent on the voltage across the gap between the anode and cathode, and is limited to 
around 3 MV/m for electrostatic accelerators. The maximum voltage is also limited by this 
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FIGURE 3.5 Schematic of a basic RF linac where the polarity of each electrode alternates along the 


linac and flips at the RF frequency such that the electron always sees an accelerating field. 


criterion for a given size of the accelerator. The charged cathode must be separated from 
any grounded potentials such that the field does not exceed 3 MV/m for large voltages, 
as given by the DC Kilpatrick criterion. This means the cathode must be held above the 
ground and all mechanical supports should be insulated and able to hold off the applied 
voltage, a limitation that further increases the size of the accelerator. For a 3 GeV machine 
the cathode would be at least 1 km above ground (likely more) in air and no building or 
structure could be closer than around 1 km away. This can be reduced to a few hundred 
metres by using an a pressurised or electronegative gas, such as sulphur hexafluoride, or 
vacuum to hold off some of the voltage, which can sustain a higher electric field, but the 
electron/ion path must be in vacuum. This would an unfeasible requirement, and in practice 
electrostatic accelerators are limited to cathode potentials less than a few tens of MV even 
when using a pressurised, electronegative gas. 

In order to reduce the size of accelerators and allow them to be placed horizontally at 
ground level it would be ideal to use several gaps in series, with the maximum potential 
constant along the length. However, as the energy gain is proportional to the difference in 
potential across the gap, each gap must be at a sequentially increasing potential, thereby 
negating any benefit of multiple gaps. One option to allow the use of two gaps is to use 
negative ions and then strip the electrons, making it a positive ion, to allow acceleration in 
the 2nd gap with the opposite potential difference to the first gap. Such an arrangement is 
known as a tandem Van de Graaff. 

In order to use multiple gaps without increasing the potential at each subsequent elec- 
trode we can instead vary the potential in time using a metal drift-tube to shield the particles 
when the field would be decelerating. Alternatively we can use gaps of alternating potential 
difference. Here a positive potential — that attracts a negatively-charged particle to it — can 
be switched to a negative potential when the particle passes, thereby repelling it and giving 
twice the voltage, as shown in Fig 3.5. The same trick can be used over many hundreds of 
gaps (or more) allowing the beam to be accelerated to an energy far greater than that given 
by the potential difference across each gap. As the field varies with time only bunches of 
charged particles that have a duration much less than the RF period (the time over which 
the voltage is varying) can be accelerated using this method. Typically the field is alternated 
at frequencies from tens to thousands of MHz, covering the same frequency band as radio 
transmissions; hence this is known as radio-frequency (RF) acceleration. 
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FIGURE 3.6 Schematic of a Wideroe linac, with the polarity switching every drift tube. 


3.3.1 The First RF Linacs 


The earliest RF accelerator was proposed by Gustav Ising in 1924, and was built by Rolf 
Widerøe in 1928; it was known as a drift-tube linear accelerator (or linac for short) [11]. Here 
the positive and negative terminals of an RF oscillator are connected to alternating metal 
drift-tubes (a hollow metal tube that the electric fields cannot penetrate into, such that the 
particles drift in a field-free region) such that every drift-tube has the opposite polarity to 
the drift-tube on either side, as shown in Fig 3.6. Charged particles are accelerated in the 
gap between each drift-tube. Widerge’s linac was tested on a single drift-tube with only 
two gaps. As the particles are accelerated they become faster and can travel further in one 
half RF period hence the gaps increase in length with each successive gap. As the fields 
oscillate in time the potential difference will change in time as the particles traverse the 
gap. For this reason for a given potential difference it is optimal to have the particles cross 
the gap in a finite time period when the field is maximum. However as the particles must be 
synchronous with the fields, arriving at each gap half an RF period after the previous gap, 
the drift-tubes need to be sufficiently long to shield the electric fields from the particles for 
enough time that they enter the next gap at the correct phase. Again the length of the drift 
tubes increase with particle velocity. Widerge’s original linac used a 1 MHz, 25 kV source 
to accelerate potassium ions up to 50 keV. The first multi-gap linac was built in 1931 by 
David Sloan and Ernest Lawrence which produced 1.25 MeV Hg? ions using an accelerating 
voltage of 42 kV across 30 gaps, at 10 MHz. 

We can generate large potential differences in RF accelerators by storing RF energy 
over a long period of time and releasing that energy in a shorter time when accelerating 
the particles. This is achieved by placing the accelerating gap inside a can made of a highly 
conducting metal which traps the RF fields inside, known as a cavity. The RF fields can 
then be coupled into the cavity using a small antenna inside it. At certain frequencies, which 
depends on the size and shape of the cavity, a perfect standing wave is created inside the 
cavity allowing the energy to be stored for a long time, a few thousand to a few million 
RF periods dependent on the conductivity of the walls and the coupling. This increases the 
potential difference across the gap for a given input power compared to the case without 
a cavity. In circular particle accelerators, where the same cavity can be used for multiple 
passes of the beam, a single gap cavity can be utilised, but for linear accelerators, in order 
to minimise the linac length, it is preferred to use multiple gap cavities. 

In 1945 Luis Alvarez devised a variant of the Widerge drift-tube linac (DTL) where 
several drift-tubes were placed inside a cavity [12]. In this case, the two ends of each drift- 
tube have opposite potentials and the potential varies along the drift-tube. This means that 
each gap has the same potential difference and hence the gaps now have to be spaced apart 
by a full RF period, meaning that the drift tube needs to be almost twice as long. While 
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this means that less of the cavity length is utilised for acceleration, the fact that it utilises a 
cavity to store the RF energy makes up for this. Alvarez together with Wolfgang Panofsky 
built a 32 MeV proton DTL operating at 200 MHz in 1947. Currently, Alvarez-type linacs 
are commonly used to accelerate protons between 50 MeV and 200 MeV. 


3.3.2 Disk-Loaded Cavities 


In 1933 Jesse Wakefield Beams* developed a method of synchronising the RF in successive 
cavities by using an artificial lumped-element transmission line with a wave velocity equal 
to the velocity of the charged particle to be accelerated, with a number of electrodes fed 
from this line which can accelerate that particle. However, the first electron linac was not 
produced until 1946. It is often misstated that the first electron linac was at Stanford, but in 
reality the first electron linac was developed by Donald William Fry at the Telecommunica- 
tion Research Establishment at Great Malvern in the UK, which was a 0.5 MeV corrugated 
waveguide linac using a 1 MW, 3 GHz magnetron [13], which was later upgraded to 4 MeV. 
The reason for the delay was the lack of sufficiently powerful RF sources. The most common 
RF sources prior to 1937 were magnetrons, similar to those used in microwave ovens today. 
First invented in 1910 by Harry Boot and John Randall, prior to 1939 magnetrons were 
not very powerful. In 1937, a new RF source known as the klystron was developed by the 
Varian brothers, Russell and Sigurd. In both devices the kinetic energy of an electron beam 
is transferred to an RF wave amplifying either a pre-injected signal or noise in the device. 
These devices were limited by the maximum RF power that could be generated. During the 
Second World War many of the world’s scientists turned their efforts to helping in the war 
effort, and many of these were tasked with developing longer range radar systems. During 
1939-1945 improved klystrons and magnetrons were rapidly developed that could provide 
far higher powers in the MW range than their predecessors which were limited to around a 
kW. From 1945 onwards many of the scientists and engineers working on radar went back 
to particle and nuclear physics and they brought these new RF sources with them allowing 
higher particle energies to be reached. 

In 1948, unaware of the work of Fry, Bill Hansen improved upon the electron linac design 
by placing a series of periodic disks inside an RF waveguide, forming a series of small cavities 
with a potential difference between disks [14], known as a disk-loaded cavity. Each has a 
small hole for the beam to travel through, and either this hole or other additional holes 
serve to transfer RF power from cavity to cavity. Each cavity has a slightly different phase 
of the RF field and the structure will behave like a transmission line with a phase and group 
velocity which can be altered by changing the coupling of the RF power through the holes 
in the disk. This device is still the most common type of particle accelerator today. The 
high RF powers available via the new klystrons and magnetrons allowed accelerating field 
gradients higher than those available with DC accelerators. The Hansen linac was able to 
accelerate electrons to 4.5 MeV, and by 1973 the Stanford Linear Accelerator Centre had 
developed a linac utilising a disk-loaded waveguide that accelerated electrons to 30 GeV. 


3.4 Confined Electromagnetic Fields 


To have efficient acceleration we must confine the wave in an RF cavity, also known as 
a resonator. A cavity confines the wave in all 3 directions, which allows a large stored 


*Jesse Wakefield Beams had probably the most appropriate name of any accelerator scientist. 
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energy to build up providing large electric fields, while a waveguide confines the wave in 2 
directions and allows a power flow in the 3rd direction, which is useful for transporting RF 
power from the generator or, if slowed down, is also useful for accelerating particles. Here 
we examine the use of conducting walls to confine the wave. Waveguides are hollow metallic 
pipes, normally with either rectangular or circular cross section for simplicity. Alternatively, 
we can instead use concentric cylinders known as coaxial lines to confine the wave. If the 
two ends are connected to a generator and a load respectively, then power will flow along 
the waveguide from the generator to the load. A cavity is similar to a waveguide except 
that both ends have metal walls covering them, causing the wave to reflect between the two 
ends forming a standing wave inside. 

Most waveguides used in accelerators are hollow pipes of rectangular cross section, hence 
we will begin in Cartesian co-ordinates for simplicity before moving onto cylindrical coor- 
dinates as most cavities are cylindrical. In Cartesian co-ordinates the wave equation is 

ro Cb A 1 8? 


Ox? a Oy? si a2 2 AP ee 


where © is either the electric or magnetic field in the longitudinal direction, z. If we assume 
the solution to this equation varies sinusoidally in the longitudinal direction and in time 
this can be reduced to ; i š 
o P a o P Lie w 
Ox? 3y? z 2 
the solution will be an interference pattern of reflected plane waves travelling at an angle 
with respect to the propagation direction down the guide. As the walls are parallel to the x 
and y planes, we expect the solution to have sinusoidal variations in the x and y directions. 
From Gauss’s law we know that the electric fields, and hence time-varying magnetic 
fields, must be zero within a conductor. Surface currents can cancel out the magnetic field 
parallel to the surface, and surface charges can cancel out electric fields perpendicular to 
the surface, but the other field components must be continuous on both sides of the surface 
leading to fields inside the conductor and hence losses due to the movement of charges. 
This leads to the boundary conditions that electric fields parallel to the surface, Ej, and 
magnetic fields perpendicular to the surface, H, should be zero on a perfectly-conducting 
boundary, 


d=0, (3.10) 


Ay 0: 
H, =0. (3.11) 


This implies that these field components must either be zero everywhere or have a variation 
with distance such that those field components are zero on the walls but finite elsewhere 
in the waveguide or cavity. Considering these boundary conditions for a waveguide with 
waveguide width a, waveguide height b, and with metal walls along the x=0 and y=0, we 
obtain the equations for the transverse variation in the longitudinally directed component 
of the electric, E,, and the magnetic, H,, fields, 


E, = Eo(z,t)sin(k,2) sin(kyy), (3.12) 


and 
H, = Ho(z,t) cos(k,x) cos(kyy), (3.13) 


where Eo and Ho are the maximum longitudinal electric and magnetic fields, and kz and ky 
are the transverse wavenumbers in the x and y direction respectively where kry = 27/Xz,y. 
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In order to meet the boundary conditions, it is necessary that the transverse wavenum- 
bers satisfy 

mr nT 

kz = i ky E Eri 

so that there is an integer number of half wavelengths between the walls; a is the waveguide 

width, b is the waveguide height, and m and n are arbitrary indices equal to the number of 

half wavelength variations along the width and height respectively. From the wave equation 
the wavenumbers should be given by the spatial variation as, 


(3.14) 


w2 


a Tk = kat ky + kz» (3.15) 
where w = 27f is the RF angular frequency, k = w/c is the free space wavenumber and 
k, = 27r /Az is the wavevector in the longitudinal direction. Often we combine k, and ky 
together into a transverse or cut-off wavenumber kz, 


ki = ka + ke. (3.16) 


If we assume the wave varies sinusoidally in the longitudinal direction, z, and in time, t, 
where longitudinal components point in the direction of the power flow, the electric fields 
in a waveguide can be given by 


E(z,t) = | E,(z,y) | ett-*=*) (3.17) 


where Ey, Ey and E, are the (complex) electric fields in the z, y and z directions, such 
that each field component may be out of phase with the others. 

For each combination of m and n we obtain a different orthogonal mode of the waveguide. 
In a homogeneous, linear, isotropic and stationary media with a smooth-walled waveguide of 
constant cross section either the electric or magnetic field in the propagation direction must 
be zero. It is convenient to split this into two subsets where we calculate the transverse 
fields from either the longitudinal electric or magnetic fields. Where we have a non-zero 
longitudinal electric field the magnetic fields are purely transverse, hence we call this a 
transverse magnetic (TM) mode. Conversely when we have a non-zero longitudinal magnetic 
field the electric fields are purely transverse hence we call this a transverse electric (TE) 
mode. There is a third class of mode where both longitudinal electric and magnetic fields are 
zero, which are called transverse electromagnetic (TEM) modes; however, these can only be 
supported where there are two electrically isolated conductors, such as a coaxial line where 
there is an outer and an inner cylinder. Hence we have a set of modes of the waveguide 
denoted TEmn and TM,,, and TEM. As E, would be parallel to the waveguide walls, m 
and n cannot be zero for a TM mode, while for TE modes either m or n can be zero but 
not both. 

The electromagnetic waves in a standing-wave cavity can be considered as a superposi- 
tion of forward- and backward-propagating waves, hence the electric fields are given by 


E = | E,(z,y) cia + gilwetkes)) (3.18) 
Ez (x, y) 
In a cavity k, can instead only take a finite number of values where the cavity length, L, is 


an integer number of half wavelengths. Here we provide a third index to define the mode, 
p, which is the number of half wavelengths along the cavity in the z direction. TM modes 
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can have p = 0 and still satisfy the boundary conditions, but TE modes require p > 1. The 
cavity mode is hence defined as TE/TM mnp; 


pr 
ka= T (3.19) 
The sum of the wavenumbers squared must still equal the square of the free space wavenum- 
ber, and hence each mode can only resonate at a single frequency given by Equation 3.15. 
For a waveguide, the mode will propagate if k > k, and hence k, has a real component. 
If k < ky and therefore k, is purely imaginary, the wave will decay exponentially and is said 
to be below cut-off. This implies a minimum frequency at which a mode can propagate, 
known as the cut-off frequency, we = krc. For frequencies below this, k, is purely imaginary 
and hence the fields decay exponentially in z. The cut-off frequency is proportional to 
the waveguide size, hence a low-frequency waveguide is much larger than a high-frequency 
waveguide. Each mode in the waveguide will have a different cut-off frequency; however, TE 
and TM modes with the same indices will have the same cut-off frequency in a rectangular 
waveguide (this is, however, not the case in other waveguide cross-sectional shapes). If a 
wave propagates in more than one mode, the wave will be distorted due to the different 
wavenumbers for each mode, hence it is preferred to propagate the RF power in a single 
mode. Conventionally, the waveguide width is defined as being larger than the height (i.e. 
a > b), hence the lowest frequency mode is the TEi9 mode and this is the mode typically 
chosen to transport the power from the RF source to the cavity, although other modes 
are sometimes used. In order to maximise the frequency band over which the waveguide is 
single-mode we set a = 2b so that the TEo; and the TE20 have the same cut-off frequency 
and the single moded bandwidth is maximised. The dispersion diagram (a plot of w against 
kz) is shown in Fig 3.7. As we will later see, this plot is useful for finding the frequencies of 
strongest interaction with a beam. 
Using Faraday’s law and Ampere-Maxwell’s law, it can be shown that 


E, = pzp (keV LE, +wuV x H,%), 
Hi = pip (keViHz — weV x Eô), (3.20) 


where k is the free-space wavevector (k = w/c); p = HrHo and € = €,€9 are the permeability 
and permittivity of the waveguide interior (often we have a vacuum and u = plo, € = €o). 
This means that once the longitudinal field components have been solved, the transverse 
field components can then be calculated from them. 

The transverse fields for a TM mode are hence 


E= SE iy, t) cos (=a) sin (=y) , 
kra a b 
E; = — ap olz, t) sin (Za) cos (y) , 
H, = an Eola t) sin (a) cos (Ey) ; 
Be — ae Bolt cos (Za) sin (“y), (3.21) 
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FIGURE 3.7 Dispersion diagram (w versus kz) for a waveguide with cross section 71.136 mm X 34 mm 
(known as WG10 waveguide) for the first five modes. Note that the TM and TE modes with the same 
indices have the same dispersion in rectangular waveguide. 


while for a TE mode they are 


Ey = “a5 Holst) cos (r) sin (Fo) f 
Ey = Aa Holst) sin (=a) cos (Fo) ; 
ihe 
A, = et Holz, t) sin (=a) cos (Eu) , 
ka a b 
k, 
H, = “ap Hole) cos (Za) sin (Fu) l (3.22) 


The fields in a TE10ọ mode are given by 


H, = Ho(z, t) cos(kzx) cos(kyy), 


Ez = 0, 
—1WET . T 
Ey = pa N sın (Z2) ; 
tk, _ [T 
H, = —z— Ho(z, t) sin (Zz) : 
kea a 
H, = 0, (3.23) 


where F, is zero everywhere. 

The fields of the first two modes (TE10 and TE20) in a waveguide where the width is 
twice the height (a = 2b) are shown in Fig 3.8 and Fig 3.9; the fields of the first TM mode 
(TM) are shown in Fig 3.10 and Fig 3.11. 
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FIGURE 3.8 Electric and magnetic field patterns of the T Eio and T E29 waveguide modes for a cross- 


section perpendicular to the propagation direction, where the arrows indicate the direction of the field 


vector. 
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FIGURE 3.9 Electric and magnetic field patterns of the TEj9 and TEg9 waveguide modes from a cross- 


section along the propagation direction and perpendicular to the y direction, where the arrows indicate the 


direction of the field vector. 
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FIGURE 3.10 Electric and magnetic field patterns of the TM11 waveguide modes for a cross-section 
perpendicular to the propagation direction, where the arrows indicate the direction of the field vector. 
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FIGURE 3.11 Electric and magnetic field patterns of the TM11 waveguide modes from a cross section 
along the propagation direction and perpendicular to the y direction, where the arrows indicate the direction 
of the field vector. 


Acceleration 45 


The ratio of the transverse fields Z = EF, /H, is known as the wave impedance, which 
should be real if there are no losses on the cavity walls. For a TM mode the wave impedance 


1S 
Bi JPA aA 
Zam = Fp = (£) -=D (3.24) 


where the constant Zo ~ 377 Q is known as the impedance of free space. For a TE mode 
the wave impedance is 


(3.25) 


2rE = FP iar 


The impedance is useful for calculating reflections from interfaces of different cross section 
and for the development of equivalent circuit models. 


Ey | Cue -z rz 


3.4.1 Phase and Group Velocity 


As the mode in the waveguide is made up of several plane waves reflecting from the walls and 
travelling at an angle to the direction of the mode’s propagation, the mode will travel slower 
than the speed of light. However within a pulse the peaks will appear to move at a different 
velocity, which can be faster than the speed of light. This does not violate causality as it 
only appears to move faster, the pulse and hence the information always travels slower than 
the speed of light. The angle at which the plane waves propagate with respect to the z axis is 
fixed for a given mode and frequency, such that the phase fronts from successive reflections 
are synchronous, and hence a standing wave is produced in the transverse directions such 
that the boundary conditions are maintained, shown in Fig 3.12. The distance travelled by 
the plane wave from one surface to the other and back must be an integer number of free 
space wavelengths (A = w/c) so that the wave returns with the same phase. For a waveguide 
of width a, the distance travelled, l, is related to the angle of propagation, 0, by 
2a 


cos 0 A (326) 


The mode has travelled along the waveguide in the longitudinal direction by a distance of 
only Asin ð, hence the mode’s signal (or group) velocity, which is the velocity component 
in the longitudinal direction, is given by 


Asin 6 
À 


Ug =C€ = csin ð. (3.27) 
If we have a pulse of RF of finite duration, the envelope of the pulse will travel at the 
group velocity, but the peaks of the wave inside the pulse will move at a different velocity, 
known as the phase velocity. If we imagine a plane wave propagating at an angle of 0 with 
respect to the z axis, as we have seen, the mode travels in the z direction more slowly; 
however, as the phase front extends parallel to the direction of propagation to the extents 
of the waveguide, the phase front seems to have travelled a larger distance, if we look at 
the distance a peak moves in the direction of propagation in a finite time interval, but in 
reality the peak at the later time is a different part of the pulse. By considering the phase 
front as in Fig 3.13 and considering the geometry, we can see that the distance the peak 
moves in one RF period is 

lphase = Asin 0 = Az, (3.28) 


hence the phase velocity, vp, is given as 


À C k w 
=C sind ke Be’ (320) 


Up 
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and that 
enone (3.30) 
The group velocity can also be given by 
Ow 
vg =a: (3.31) 


The instantaneous directional energy flux at a point in a waveguide is given by the 
Poynting vector, S, measured in W/m?, as 


S=ExH. (3.32) 


In a waveguide the Poynting vector points in the direction of propagation. In a cavity the 
real part of the Poynting vector is zero as there is no net power flow — the forward and 
backward components cancel; however, there is an imaginary component giving a reactive 
power back and forth which may have transverse as well as longitudinal components. Later 
in this chapter we will discuss travelling-wave structures which are a hybrid of a cavity and 
a waveguide, in which case the Poynting flux has both real and imaginary components. The 
power contained in the RF wave, Pa, is the integral of the time-averaged Poynting vector, 
Sav = |E x H*|/2, where * denotes the complex conjugate, 


1 a b 
P= F l Re|E x H*|dzdy. (3.33) 
2 Jo Jo 
For a rectangular waveguide this is 
E? 
Pay = ab, 3.34 
ieee (3.34) 


where Emax is the maximum electric field. The maximum power flow in a waveguide is 
limited by the peak electric field. For a 3 GHz TEi9 mode in a WG10 standard waveguide 
(a = 72.136 mm, b = 34.036 mm), assuming a maximum peak electric field of 3 MV/m (in 
air), the maximum power flow is 2.26 MW. The group velocity is also related to the power 
flow in a cavity by 
Pay 
Y= 
where U is the stored energy in a cavity of length L. 
In order to have an efficient accelerator we want the electromagnetic fields transported 
to the accelerating structure to remain in the accelerating structure, only decaying due to 
ohmic losses in the walls. While a waveguide can be used as an accelerating structure we 
would have to slow the group velocity down to prevent the power leaving the structure too 
quickly, while simultaneously reducing the phase velocity to be equal to the particle velocity, 
which requires the structure to be loaded with either a dielectric lining, a corrugated wall 
or a series of iris’ 


(3.35) 


3.4.2 Electromagnetic Fields in Cylindrical Cavities 


Cavities are typically cylindrical rather than rectangular. In a real accelerating cavity there 
will be beampipes with smooth transitions where they meet the cavity; however, it is useful 
to first understand the fields in a cavity of constant circular cross section but closed at 
both ends; this is known as a pillbox cavity. In cylindrical coordinates (¢, r, z), the wave 
equation is 


10 / ð 106 , w? 
. (2) t a gga oai (3.36) 
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FIGURE 3.12 Wave reflecting inside a waveguide showing wavefront coherence over multiple reflections. 


Wave fronts 


FIGURE 3.13 Group and phase velocities from propagation angles. 
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TABLE 3.1 The mt” root of the 
nt} Bessel function of the first kind. 
m\n 0 1 2 
1 2.405 3.832 5.136 
2 5.520 7.016 8.417 
3 8.654 10.173 11.620 


Separating variables and applying a periodic boundary condition to the azimuthal compo- 
nent, we find the solution to this equation is a radial varying function, Rm, which satisfies 
Bessel’s equation whose general solution is 


Rm = AıJmlkir) + AəNml(kir), (3.37) 


where Jm is the mn Bessel function of the first kind and Nm is the min Bessel function of 
the second kind. Since Rm must be well-behaved at r = 0, and Nm > —oo at r = 0, we set 
the constant A> = 0. For TM modes, FE, = 0 at the cavity radius, a, due to the boundary 
conditions, hence kya = Cmn where Cmn is the nt” root of the m*” Bessel function of the 
first kind. Considering the fields must be sinusoidal in ¢ and z, this leads to 


E, = EoJm ¢ n) cos(m@) cos (pr) exp(iwt). (3.38) 
a L 


The index m is the number of full-wave variations around ¢ and the index n is the number 
of half-wavelength variations across the cavity diameter. The roots are given in Table 3.1. 

As a cavity is fully enclosed by metal walls, the boundary conditions are only satisfied 
at discrete frequencies, as discussed previously. The resonant frequency of a cavity mode is 


given by : 
(2) =e ante a (E) + (==) . (3.39) 


C L a 


The transverse components of the fields can again be found using Equation 3.20. This leads 
to the field components of a TMmnp mode being given by 
Cam Z . 
E, = EoJm | r-— ] cos (mo) cos (pn) exp (iwt) , 
a 


_ kz i Grin: ; Z 4 
Erz = Enga (2) cos (m@) sin (pr=) exp (iwt) , 


kz mn . . . 
Eg = Eo tS ($ ) sin (m@) sin (pn=) exp (iwt) , 
r a 


ke 
H,. = Pep bod (=) sin (mø) cos (pr) exp (iwt) , 
-WE p (4m ) cos Aom ; 
Hy = i; EoJm (« 7 ) cos (mo) cos (pn=) exp (iwt) , 
H, =0, (3.40) 


where Jh (x) = dJmn(x)/da. 

In order to accelerate a charged particle beam we need to have an electric field in the 
direction of the beam’s motion; hence, if the beam travels in the z direction, then only a 
TM mode can accelerate the beam, although some complex structures can distort the fields 
to give a TE mode an E, component. Depending on the length of the cavity the lowest 
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FIGURE 3.14 Electric and magnetic fields of a TMo19 mode in a pillbox cavity. 


resonant frequency will either be the TMoi9 or the TE,;; mode. In a simple cylindrical 
cavity used for accelerating relativistic particles, we normally have a short cavity length, 
hence the TMo10 will have the lowest resonant frequency; however, some low-energy proton 
and ion accelerators utilise a TE mode instead. The fields of a TMo19 mode are shown in 
Fig 3.14 and are given by the equation 


2.405 
E; ~ EoJo —— exp(—iwt), 
iEo . (2.405r 
Ay ~ 
$ Zo Ji z exp(—iwt), 
E, = Eg = H, = H, = 0, (3.41) 


where Zo = 377 Q is the impedance of free space, and Jj(x) = —Ji(z). 

It should be noted that while Hg is zero in the cavity centre it doesn’t mean that the 
beam doesn’t experience these fields, as the beam will have a finite radius and hence the 
particles on the outside of the bunch will experience these transverse fields as well as the 
accelerating field. The effect of this is covered in detail in Chapter 5. 


3.4.3 Coaxial Lines 


If we have two electrically isolated conductors then the waveguide can also support TEM 
modes as well as TE and TM modes. TEM modes have no longitudinal field components, 
H, = E, = 0, and the electric field parallel to surfaces and magnetic fields perpendicular 
to surfaces are also zero, i.e. H1 = Ey = 0. As a consequence, the fields only have variation 
to conform to the surfaces, and hence the transverse wavenumber, k1, is zero. This means 
that the wave travels longitudinally, and hence propagates at the speed of light in the filling 
medium; k = k, and hence has no cut-off frequency. Common waveguides that support TEM 
modes are parallel plates (two parallel conducting plates separated by some distance), and 
coaxial lines (two concentric cylinders where the fields propagate in between the inner and 
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Electric field Magnetic field 


FIGURE 3.15 The electric and magnetic fields of a TEM mode in a coaxial line. 


outer conductor). For accelerators operating at low RF frequencies below around 400 MHz, 
coaxial lines are commonly used to keep the waveguide transverse size down, due to the 
lack of a cut-off frequency for TEM modes, where a TE mode would require very large 
dimensions to operate above cut-off. In a coaxial line the field components in cylindrical 
components are 


E,. = Edat) 
F 
Eo(z,t) 
Pia SE 42 
$ Zar (3.42) 


The fields of a TEM mode in a coaxial line are shown in Fig 3.15. Coaxial lines can also 
support TE and TM modes, hence it is usual to keep the outer conductor radius within 
limits to operate below the cut-off of the TE11 mode. The transverse wavenumber for the 
TE,, in a coaxial line of outer conductor radius b and inner conductor radius a, is given 
approximately by r 
BINTI (3.43) 

In the limit where the a tends towards b, the cut-off frequency of the TEM mode is 1.8 
times lower than in a circular waveguide of radius b. 

In a coaxial cavity, with metal walls at both ends connecting the inner and outer con- 
ductor, there are an integer number of half wavelengths along the line, and hence the mode 
is defined as a TE Moop mode. 


3.4.4 Walls with Finite Conductivity 


Real waveguides and cavities have walls with a finite conductivity and hence work is done to 
shield the fields inside the metallic walls. Most RF structures are made from good conductors 
so the charges can redistribute to keep the fields similar to those with a perfect conducting 
boundary with the electric fields only penetrating a short distance into the conductor known 
as the skin depth. The fields will decay exponentially between the surface and the skin depth. 
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= T (3.44) 


where u is the permeability of the conductor (where for most RF materials u = uo) and ø is 
the electrical conductivity of the conductor. As there is an electric field inside the conductor 
a current is induced in it, given by Ohm’s law J = cE, where J is the current density. This 
in turn leads to a power loss as the current is being driven through a resistance. This surface 
resistance, Rsurf, is given by 


The skin depth, 6, is given by 


1 
| ae ee (3.45) 
oå 


Annealed or oxygen-free high conductivity copper, which is a common material for the 
construction of RF cavities, has a conductivity of around 5.8 x 10" S/m at room temperature, 
meaning it has a surface resistance of 14.3 mQ and a skin depth of 1.2 um at 3 GHz. The 
surface current is proportional to the magnetic field in the conductor, so that the RF power 
loss, P., over a surface, S, is given by 


1 
P. = Rous f |H|?dS. (3.46) 
2 S 


This power is lost directly from the RF field, and in the case of a cavity, reduces the stored 
energy in the cavity. This power is converted to heat causing the cavity temperature to rise. 
The RF power loss can be of the order of a few to hundreds of kW for normal conducting 
cavities, which can raise the cavity temperature to dangerous levels if the cavity isn’t suf- 
ficiently cooled with circulating water. As the role of the cavity is to store electromagnetic 
energy, it is desirable to reduce the losses while maximising the stored energy. This leads 
to the ohmic quality factor, sometimes called the intrinsic Q factor, Qo, of a cavity, which 
is proportional to the ratio of stored energy, U, to ohmic losses, 


Qo = = (3.47) 


The higher the Q factor of a cavity, the more energy it can store for a given RF input power. 
As we will later see, the Q factor is also proportional to the filling time of the cavity and 
inversely proportional to the cavity bandwidth. A copper cavity will have a Qo ~ 104 at 
3 GHz while a superconducting cavity will have Qo ~ 109-10!" depending on its operating 
frequency and operating temperature. In order to compare cavity geometries it is useful to 
define the geometry factor, G, which is independent of the cavity wall material and is 


G = ReutQo- (3.48) 


As an example the superconducting TESLA cavity, operating at 1.3 GHz, has a geometry 
factor of 250 Q, providing a Qo ~ 2.5 x 10!° for a surface resistance of 10 nQ [15]. In the 
case of a waveguide, as an electromagnetic wave propagates, the energy density of the wave 
increases as the group velocity of the wave decreases. The losses in a waveguide are depen- 
dent on cross-sectional area, waveguide length, group velocity of the wave, the ratio of the 
operating frequency to the cut-off frequency of the waveguide, and the waveguide conduc- 
tivity. As the power lost is relative to the incident power, the attenuation is exponential. 
The attenuation in a waveguide can be represented as an imaginary wavenumber. The loss 
coefficient, a, is the imaginary part of the axial wavenumber, where 8, is the real part of 
the axial wavenumber and k; = pz — ia. The transmitted power, Pr, is then given by 


Pp = Poe, (3.49) 
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where z is the length of the guide and Pp is the input power. Hence the power lost due to 
ohmic heating, Pr, is 


Py, = Po(1 — e°). (3.50) 
Differentiating this equation with respect to z and rearranging gives an equation for a; 
1 OP, 
= ————. 51 
“OP dz en 


3.5 Accelerating Modes in Cavities 


The main purpose of an RF cavity is to accelerate the beam either to increase the beam’s 
energy or to replace energy lost due to radiation and other processes. The voltage (potential 
difference), V, between two points A and B is 


B 
v=] E,(z)dz; (3.52) 


however, as the electric field varies in time so will the voltage. 


B 
vo= f Eee" de, (3.53) 


where w is the cavity’s resonant frequency. In the case of an accelerating cavity we want to 
calculate the energy gained by the charged particles. However, the particles cannot travel 
faster than the speed of light, c, hence it will take a finite time to traverse the cavity’s 
accelerating gap; the electric field will change in strength, meaning the particle will not 
experience the full voltage of the cavity. Instead we must take into account the particle’s 
position with time, given by z = vt where v is the particle velocity (for convenience we often 
use the fractional velocity, 8 = v/c). The instantaneous accelerating voltage experienced by 
the beam, Vace, is then 


B 
Vace = | E,(z)e7”?/"dz. (3.54) 
A 


The ratio of the voltage seen by a particle travelling with finite velocity and the voltage 
seen by a particle travelling with an infinite velocity is known as the transit-time factor, T, 
\Vace| 


T=. (3.55) 


It is useful to also express the average accelerating field experienced by the beam, Eacc, 
over a cavity of length, L, known as the cavity accelerating gradient (or gradient for short) 


Vace 
Face = ——. 3.56 
z (3.56) 
The gradient of a pulsed normal conducting cavity can be up to ~100 MV/m for 12 GHz 
cavities for 200 ns long RF pulse durations, while for 1.3 GHz superconducting cavities it 
can be up to 35 MV/m*. If we take a pillbox cavity (i.e. a hollow cylinder) such that there 


*The record is 52 MV/m in a single-cell cavity but this cannot be achieved reliably or in multicell 
cavities. 
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FIGURE 3.16 Transit-time factor as a function of the gap length in an RF cavity. 


is no longitudinal variation of the fields (p = 0), then the transit-time factor is given by 
sin (32) 
T = —_—. (3.57) 


where g is the accelerating gap, which is shorter than the cavity length for multi-cell cavities 
due to the finite wall thickness. A plot of this equation is shown in Fig 3.16; while it would 
suggest that we get a larger gradient if we have a chain of very short cavities, in practice this 
is not the optimum configuration as there must be a finite wall thickness between cavities. 
If we have more cavities per unit length we also have more walls, and hence more wasted 
space. More cavities also means more couplers, or if we couple the cavities together, we must 
synchronise the fields with the beam. In practice the optimum cavity length for a single cell 
standing wave cavity is roughly given by 

TBC 


Lopt S (3.58) 
where the wall thickness is as short as possible. For thin walls, where g ~ L, the transit 
time factor is equal to 2/7 for a pillbox cavity. 

The cavity voltage seen by the beam will vary sinusoidally with the beam arrival phase. 
The maximum voltage of the cavity is often higher than the operating accelerating voltage 
as we often chose not to inject the beam at a phase corresponding to the peak voltage, as 
will be discussed in Chapter 5 when we look at beam stability. The ideal cavity should give 
the maximum voltage for a given dissipated (and hence supplied) RF power. To relate the 
dissipated power to an accelerating voltage, we use the cavity shunt impedance, Rs circuit; 


defined as Vael 
Rs circuit _ QP, ? (3.59) 
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this is equivalent to the power dissipated across a resistor with an AC voltage applied across 
it, with the factor of 2 in the denominator due to the peak voltage Vace and RMS power P, 
being used. Most accelerator physicists use an alternative definition — due to the fact that 
the particle bunch does not see the sinusoidal variation of the voltage — of 


(3.60) 


For linear accelerators it is often more useful to state the shunt impedance per unit 
length, rs. 


= Vercel 
r= f 
PL 

The CLIC-G accelerating structure, which has an operating frequency of 11.9942 GHz, 
has a shunt impedance per unit length of 92 MOQ/m [16]. The shunt impedance for a cavity 
which has all dimensions scaled to a different frequency will have the shunt impedance per 
cell scaled œ 1/y/f; however, as a higher-frequency cavity will have more cells per unit length 
hence, the shunt impedance per unit length scales x yf, hence allowing higher-frequency 
cavities to reach higher gradients for a given input power. However, if you doubled the 
frequency the aperture would halve for a scaled structure. Normally the aperture of a linac 
is constrained to a minimum aperture for beam stability and losses, hence when going up 
in frequency, the ratio of the aperture radius to the wavelength (a/A) increases reducing 
the shunt impedance. Fig 3.18 shows the shunt impedance per unit length as a function of 
aperture for several different frequencies, where it can be seen that for any two frequencies 
there is a maximum aperture where the higher frequency provides a higher shunt impedance 
per unit length, and above that aperture the lower frequency is better. 

As the shunt impedance per unit length (as well as the maximum gradient as we will 
see later) is strongly frequency dependent, cavities are grouped by their resonant frequency 
using the IEEE RADAR RF bands (loosely based on the old NATO bands). L-band (long 
wave) goes from 1-2 GHz, S-band (short wave) is 2-4 GHz, C-band* from 4-8 GHz, and 
X-band (short for ‘crosshair’ as it was used for fire control in World War IT) from 8-12 GHz. 
In each band there are generally two commonly utilised frequencies — either European or US 
in origin- the difference being whether the wavelength is an integer number of millimetres 
(European) or integer fractions of an inch (US); for example, X-band structures are either 
11.9942 GHz (European) or 11.424 GHz (US). Although many European accelerators now 
use US frequencies and vice versa. 

If a higher shunt impedance is required we can add what are known as nose cones, as 
shown in Fig 3.17. These are cones around the iris which reduce the accelerating gap, and 
hence increase the transit time factor, without increasing the magnetic field at the equator, 
hence the shunt impedance increases. However the peak electric field on the tips of the nose 
cones increases as the nose cones increase in length meaning this technique is not typically 
used for very high gradient accelerators. The addition of optimised nose cones will improve 
the impedance by around 10%. 

It is also useful to define the geometric shunt impedance R/Q, which like the geometry 
constant is independent of the cavity size and material. This impedance also relates the 
induced cavity voltage to the driving beam current and is a measure of the coupling between 
the fields in the cavity to the beam. 


(3.61) 


R Vace . 


*The ‘C’ in C-band stands variously for ‘commercial’, ‘communication’, or ‘compromise’. 
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FIGURE 3.17 An RF cavity with nose cones to decrease the gap size while keeping a large cavity volume 
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FIGURE 3.18 Shunt impedance per unit length as a function of aperture radius for a disk-loaded cavity 
for 3, 6, 9 and 12 GHz resonant frequencies where the disk thickness is 0.08 À. 
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Each mode in the cavity has a geometric shunt impedance relating the coupling of this mode 
to the beam. Modes with high geometric shunt impedance can be stimulated within the 
cavity as a bunch of charged particles traverse the cavity. These unwanted cavity harmonics 
can lead to detrimental effects on subsequent bunches. Since the effect is induced in the 
wake of the bunch this type of interaction is known as a wakefield. The amount of energy 
transferred is proportional to the cavity shunt impedance and inversely proportional to the 
cavity size. 


3.5.1 Cavity Equivalent Circuit 


An RF cavity can be modelled as an RLC series or parallel circuit, which makes calculations 
of dynamic behaviour or mode coupling much simpler. The resistance of the circuit is the 
shunt impedance of the cavity, R,. The capacitance and inductance are calculated from the 
cavity resonant frequency, wo, and the geometric shunt impedance R/Q using 


1 L 2 
“= JEC TENE JC’ (3.63) 


the Q factor can then be calculated using 


IC 
Q = Rs circuit T (3.64) 


3.5.2 Coupling Power into an RF Structure 


To connect the RF power supply to the cavity we must construct an antenna that will 
radiate power into the cavity (see discussion of antenna radiation in Chapter 6); this avoids 
the power being reflected back up the waveguide. This is normally just a waveguide or 
coaxial line connected to the cavity via a small hole in the beam pipe or the cavity walls, 
known as an input or power coupler, which will be discussed later. The strength of the 
coupling can be represented by defining an external Q factor, Qe, which relates the stored 
energy in the cavity to the power that would flow into the coupler if there is no RF power 
being supplied to the cavity, Pe; this is given as 


(3.65) 


It is convenient to add the external power lost to the coupler with the input power turned 
off, P, to the cavity ohmic losses, P, to give the total losses with the RF supply off P,. 
Since P, = Pe + P. then we can also define a quality factor combining all losses known as 
the loaded Q factor, Qz, where for a cavity with a single coupler, 


1 1 ma 1 
QL Qo Qe 
It is also useful to define the coupling factor, 8, which is the ratio of losses through the 
coupler to the ohmic losses in the cavity walls 


P, e Qo 
B= =. 3.67 
Pe Qe oe 
The cavity will have a finite bandwidth over which power is coupled into the cavity. The 
impedance of the cavity can then be solved from the equivalent circuit as a function of 


(3.66) 
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frequency w for 


Z= Rs circuit : (3.68) 
LAO (2 = æ.) 
This gives a full width half maximum bandwidth in Z of 2Qz/w. 

At frequencies outside of this bandwidth all the power will be reflected. When we have 
an RF pulse the rising and falling edges of the pulse will contain a wide range of frequencies, 
some of which will fall outside the band and will be reflected. For a square pulse, almost all 
of the power at the rising edge will be outside the band and all of the power will initially 
be reflected, but over time the bandwidth will reduce, reducing reflections and increasing 
the power coupled into the cavity. For slower rise times and larger cavity bandwidths the 
reflections are reduced. To model this, we can consider an equivalent circuit. When RF 
power is supplied to the cavity there will be a large impedance mismatch between the 
coupler, which will have an impedance of a few tens of ohms, as it is required to have a 
high power flow for transport, to the cavity which will have an impedance of several MQ in 
order to reach high gradients with minimal power. This means that at the interface between 
the coupler and the cavity there will be a large reflection in anti-phase to the supplied RF 
power. This reflected signal from the interface will interfere with the power leaking back 
into the coupler from the cavity which will be in phase with the supplied RF power. The 
total power flowing back up the coupler, P,, when driving the cavity on resonance, will be 


equal to > 
P, = (ve 2) (3.69) 


where Py is the forward power from the RF source. The reflection from the interface between 
the cavity and coupler due to the mismatch will be slightly less than 100% in reality. We 
will refer to the total reverse power going back up the waveguide as the reflected power P,, 
the power reflected from the interface between the cavity and coupler when the cavity is 
empty as the interface reflection, P;, and the power leaking back up the coupler from the 
stored energy as the emitted power, P.. The change in stored energy over time in an RF 
cavity without beam can be obtained by summing the power flowing into and out of the 


system as 
2 
dU wU wU 
— = P; — P; . 3.70 
q f (v PH z) Qo (3.70) 


Expanding the brackets gives 


du [4P pat i to 
=o. Ww (G- ! ae (3.71) 


and inserting the definition of loaded Q factor into this equation gives us 


dU APrwU wU 
=4/ : 3.72 
dt Qe QL ( ) 


We can hence find the steady-state stored energy, Uo, when the stored energy no longer 
varies with time (dU/dt = 0), by solving the quadratic equation for VU, 


7 4P;Q7 APB Qo 
Co ee (3.73) 


Uo 
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and we can also solve the time dependence of the stored energy, assuming the initial energy 
is zero, by solving the first-order nonlinear ordinary differential equation as 


2 
U = Uo (1 - eee) (3.74) 


It can be seen that the stored energy increases with time as the cavity fills with RF energy, 
converging to Up. The time constant for the filling, 7, is 


WwW 


~ Qn 


which is inversely proportional to the loaded Q factor rather than the ohmic Q factor. 
Having solved for the stored energy we can return to solving the reflected power; inserting 
Equation 3.73 into Equation 3.69 we obtain the steady-state reflected power 


2 
P. = Py h = a) (3.76) 


T 


(3.75) 


Qe 


and inserting the definition of the coupling factor we obtain 


2 
z 2 
P, = P; (: ; 2.) (3.77) 


It can be seen that when 6 = 1 the reflected power (flowing back up the coupler) is zero and 
the cavity is said to be critically-coupled. This can be interpreted as the reflections from the 
interface — due to the impedance mismatch between the cavity and the waveguide — exactly 
cancelling out the power emitted from the cavity into the waveguide as they will have equal 
magnitude but will be 180° out of phase. 8 = 1 when the ohmic and external Q factors, and 
hence the external coupler and ohmic losses, are equal. When 8 > 1 the ohmic Q factor is 
greater than the external Q factor and hence the coupler is said to be over-coupled; when 
8 < 1 it is said to be under-coupled. This can also be rearranged to find the coupling factor 
by measuring the steady-state reflections from a cavity to give 


1= /P,/P; 


with the upper sign used if 8 > 1 and the lower sign used if 6 < 1. Often \/P,/Py is referred 
to as the input port reflection coefficient, S11, which is the first element in the scattering 
matrix of reflected and transmitted waves from a multiport RF network [17]. By inserting 
Equation 3.74 into Equation 3.69 we can also solve for the reflected power for the case of 
the time-dependent reflections 


2 
P, = P; i a (1 - | (3.79) 


The first term represents the interface reflection and the second term the emitted power from 
the cavity. For the case of a critically- or under-coupled cavity, the reflections are initially 
close to 100% as there is no stored energy in the cavity to cancel the reflections at the 
interface. As the stored energy builds up in the cavity, so does the power emitted back down 
the coupler from the cavity, cancelling out some of the power reflected at the interface and 
reducing the power flowing back up the coupler. For a critically-coupled cavity the reflected 
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FIGURE 3.19 Transient reflected power for a square wave input pulse, where tpulse = 67 for 8 =0.5, 
1 and 2. 


power tends to zero over the filling time of the cavity, while for an under-coupled case they 
tend to a finite value. For an over-coupled cavity the behaviour is initially identical but 
soon the emitted power grows larger than the interface reflection. Hence the superposition 
of both reverse signals causes the reflected power to reduce to zero and then start increasing 
again to a finite value, as first the interface reflection dominates the reflected signal, then 
the emitted power. As these two signals have a 180° phase difference, the phase of the 
reflected signal also changes by 180° as it crosses zero. 

When the RF is switched off suddenly, Py becomes zero, hence it no longer cancels 
the emitted power and the reflected power will again spike with the peak reflected power 
directly after the RF pulse is switched off given by 


2: 
U 2 
a - =P; (25) (3.80) 


with an over-coupled cavity creating a reflected power spike up to four times the power of 
the initial forward RF power, a critically-coupled cavity reflected power spike the same size 
as the forward power and the under-coupled cavity with a smaller spike than the forward 
power. The reflected signals from a square envelope pulse, of duration tpulse, for each case 
is shown in Fig 3.19. The stored energy in the cavity will decrease exponentially with the 
time constant of the cavity 


U = Ue“, (3.81) 
Hence the stored energy will vary with time as 
4P;Q? 
U = Upe Wt trutse)/Qn — Oa oi e7“ lt- tpuise)/Qr, (3.82) 


The cavity voltage can then be obtained from the R/Q of the cavity. When the RF is turned 
off we can set the forward power to zero in Equation 3.69 but maintaining the same stored 
energy at the moment the RF is turned off; this will then decay exponentially. This yields 


2 
ace E a aes - 
r f 1+8 e ; ` 
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FIGURE 3.20 The reflected signal as a function of frequency on a polar plot for G = 0.5, 1 and 2. 


where Py is the power before the RF is turned off at time tpxise. If the RF drive frequency, 
w, is different than the cavity resonant frequency, wo, the steady-state reflected power can 
be given by [18] 


2 
1— B—iQod 
P, = Ps | ——————— 3.84 
(85) oe 
where ô is given by 
(22s (3.85) 
Wo W 


Plotting the reflected signal as a function of frequency on a polar plot will give a circle which 
will not enclose the origin if under-coupled, will cut through the origin if critically-coupled 
and will enclose the origin if over-coupled; this is shown in Fig 3.20. This allows the coupling 
to be measured from the reflected power. 


3.6 Gradient Limits 


The maximum gradient in a normal conducting structure is often limited by a number of 
phenomena: 


e RF breakdown; 
e RF heating; 
e RF source power limits or operating cost. 


RF breakdown is where the high electric fields cause some of the walls to be vaporised, and 
then ionised causing a plasma to form inside the cavity. RF heating is where the power lost in 
the cavity walls causes the temperature of the cavity walls to increase causing deformation 
and stresses which can affect normal operation. The RF sources are also limited by RF 
breakdown and RF heating and this leads to a limited RF power, which in turn limits the 
cavity voltage. Using high RF powers also implies a large electricity bill which can also 
be a limiting factor. If limits of the RF power supply are ignored for now, the physical 
limits of the cavity gradient are RF breakdown which is dependent on the peak surface 
electric, Epk, and magnetic, Bpk, fields, and RF heating which is related to the surface 
magnetic field. Hence two important criteria for cavity design are the ratios of the surface 
fields to the gradient Epk/FEacc, and Bpk/Eace. Typically values are Epk/Eace ~ 2 — 4 and 
Bpk /Eace ~ 4 mT/(MV/m). 
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3.6.1 RF Breakdown 


RF breakdown is where a plasma discharge within the accelerating structure grows to an 
extent where RF operation is not possible. The discharge causes an impedance mismatch 
which causes the power to be reflected back up the coupler, and absorbs the RF energy 
inside the cavity, hence stopping the cavity operation. Repeated breakdowns can also cause 
permanent damage to the structure. RF breakdown — also known as a vacuum arc — requires 
a gas to ionise but the cavity is initially uder vacuum. This process is initiated by field 
emission; if the heat created by the current flow and the RF heating becomes large enough 
there will be vaporisation of material. This gas can then be ionised by the emitted electrons 
leading to plasma formation. The plasma causes more material vaporisation leading to 
a growth of the plasma into a runaway current, known as an arc discharge. In 1957 W.D. 
Kilpatrick devised an empirical formula for the maximum surface electric field that could be 
sustained before breakdown. This was then reformulated to include a frequency dependence 
by Boyd [19] to the more common RF Kilpatrick limit 


8.5 
f =1.64E%, exp (-=) (3.86) 
where the frequency f is in MHz and the electric field Ep, is in MV/m. For a 3 GHz cavity 
the Kilpatrick field limit is 47 MV/m, and rises to 90 MV/m at 12 GHz; this is shown in 
Fig 3.21. This is not the gradient but the maximum electric field on the surface which is 
typically twice as large as the gradient. As cavity manufacture and preparation has been 
improved this limit is now regularly exceeded by a factor of 2, with the CLIC structures 
demonstrating 100 MV/m gradient (250 MV/m peak surface field) for a 200 ns pulse at 12 
GHz [16]. Breakdown is statistical in nature, as the location and time of breakdowns cannot 
be predicted but rather follows a probability that increases with electric field; so rather than 
being a hard limit, it is common to refer to the breakdown rate (BDR), given in breakdowns 
per pulse per metre, at a given field level. The breakdown rate is also dependent on the RF 
pulse duration, as well as the surface electric field and the cavity frequency. Typically, RF 
cavities in linacs will aim to operate with less than one breakdown per million pulses per 
metre to minimise structure damage and disruption to the beam being accelerated. As we 
saw previously, field emission can be increased on sharp tips, which can serve as breakdown 
sites, although this is not the only mechanism proposed as the cause of increased field 
emission. As a cavity is often manufactured with a number of sites which have a higher 
probability of breakdown it is necessary to condition the cavity. This consists of increasing 
the RF power very slowly over a number of hours, days or weeks, keeping the breakdown rate 
below a preset level. If the field is close to the cavities current breakdown limit the breakdown 
rate will decrease over millions of pulses, allowing the field to be increased. Traditionally 
this is considered to be due to a number of semi-controlled RF breakdowns, this causes 
vaporisation of the field emitters just above the breakdown threshold causing a minimum 
amount of damage. However, this can cause the material to be sputtered elsewhere, creating 
more field emitters, hence there is a limit to the gains from conditioning. More recently it 
has been suggested that the conditioning process is dependant on the number of RF pulses 
rather than the number of breakdowns which may condradict the traditional explanation 
[20]. After conditioning, the breakdown rate will scale with electric field and pulse duration, 
tpulse, as 

BDR « E°°t? (3.87) 


pulse* 
Hence the BDR increases very sharply with increasing field producing something close to a 
hard limit [21]. 
The causes of the field emitters are not known but several theories exist [22]. One such 
theory is the electromagnetic field applies a stress to the cavity surface which can give rise to 
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FIGURE 3.21 Maximum surface electric field versus RF frequency from the Kilpatrick criterion. 


sharp tips if the stress is applied near a defect under the cavity surface such as a dislocation 
in the copper atomic lattice [23] and conditioning is a form of work hardening of the surface. 
As such, the probability of a field emitter appearing, and hence causing a breakdown, would 
depend on the electric strength and the material properties. The suggested scaling of the 
BDR with the electric field using the stress is 


BDR «x e(©(8sBace)"AV/ksT) (3.88) 


where AV is the relaxation volume of the defect, kg is the Boltzmann constant, and T is the 
temperature of the defect. More recently CERN [21] has shown good empirical agreement 
between the BDR of a structure and the peak value on the surface of a modified form of 
the Poynting vector, known as Se. 


Se = Re[S] + g-Im|[S] (3.89) 


where ge is a weighting factor due to the different effects of active and reactive power, 
which is 0.15 to 0.25 depending on the local field enhancement factor, typically taken as 
1/6. CERN suggests that for a 12 GHz RF pulse with 200 ns duration, the breakdown rate 
will be 1 breakdown per million pulses, per metre for an Se of 5 MW/mm?. The BDR will 
scale with S, and pulse duration as 

BDR œ Sit (3.90) 


pulse* 


3.6.2 Multipactor 


Another common cause of electron discharge is multipactor, where the number of free elec- 
trons in the cavity vacuum undergoes an exponential growth in time. When an electron 
strikes a surface it can be absorbed, reflected (elastically backscattered), re-diffused, or cre- 
ate secondary electrons [24]. There may be more than one secondary electron emitted for 
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FIGURE 3.22 Secondary emission yield, including true secondaries, 64, re-diffused, 6,, and elastically 
backscattered electrons, Ôe, as a function of primary electron impact energy for copper [24]. 


each primary electron impacting the surface depending on the primary electron’s impact 
energy and impact angle. The average number of secondaries per primary is known as the 
secondary emission yield, and this is shown as a function of impact energy for copper in 
Fig 3.22. The process is statistical but the average number of secondary electrons per pri- 
mary electrons, 6, for most metals and ceramics is greater than one for primary impact 
energies from a few tens of eV up to a few keV, and less than one for other impact energies. 
These secondaries will experience a force from the RF fields causing it to move from the 
impact location. 

For multipactor to occur, the secondaries created must return to the surface at the 
correct impact energy over many RF cycles, requiring secondaries to return to the same 
impact site, at the same phase (although the electron could oscillate between two fixed 
impact sites, known as two-point multipactor) [25]. This resonance condition will only be 
met at discrete RF field amplitudes, but when the conditions are met, any stray electrons 
will cause an exponential growth in the number of secondaries causing RF heating of the 
surface and absorbing RF power. The number of particles N after a number of impacts, 
Nimpacts is given by 


N (Nimpacts) = No ae ; (3.91) 


where No is the number of initial electrons, and (6) is the average number of secondary elec- 
trons produced per primary electron impact. Multipactor typically happens in low electric 
fields either at low cavity voltages, or in locations where the electric field is lower, in order 
to have the electrons impact the surface at an energy likely to produce more secondaries. 
A common multipacting trajectory is where electrons make semi-circular cyclotron orbits 
around high magnetic fields, in the low electric field region, giving a resonant condition 


B=f-, (3.92) 
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where me is the electron mass. This type of multipactor was a limiting factor in the original 
superconducting cavities, but this was avoided in later cavities by making the cavity equator 
elliptical such that the electrons strike the surface at different angles depending on the orbit 
radii, causing the secondaries to move in successive orbits towards the centre where the 
electric field is zero and electrons cannot be accelerated to a sufficient energy to create new 
secondary electrons [26]. 


3.6.3 RF Heating 


At high RF powers, RF heating can be a major issue. Typically, normal conducting RF 
cavities have their temperature regulated via cooled, turbulent water flowing at high mass 
flow rates through metal pipes joined to the cavity. For a sufficiently high mass flow rate, 
the pipes and the cavity surfaces in contact with them can be held at a fixed temperature, 
although there are limits to flow rate due to cavitation. However, there will always be a 
temperature gradient between the RF surface where the heat is applied in the skin depth, 
and the cavity surfaces in contact with the pipes. For a cylinder of internal diameter, d, 
thickness, t, and length, L, with heat flow, Q, applied to the inner diameter and the outer 
surface held at a fixed temperature, the temperature difference between the inner and outer 
surfaces, AT, is given by 
QIn (1 + 2t/d) 
27 Lk , 
where « is the thermal conductivity of the cylinder. This will result in thermal expansion 
of the cavity and hence the cavity will detune (change its resonant frequency). For a single- 
cell cavity, the detuning can be corrected by varying the water, and hence the cavity, 
temperature or by using a tuner which can change the cavity frequency by perturbing the 
fields via a moving part or surface deformation. However, for multicell cavities it is likely 
that all cells will deform differently causing each cell to have a different frequency, and hence 
causing a cell-to-cell amplitude and phase variation. 

For short-pulse RF systems, where the pulse duration, tpulse, is short compared to the 
time it takes for heat to diffuse into the cavity walls, the temperature rise can be much 
sharper [27]. When the RF pulse first starts causing ohmic heating, the heat is deposited 
entirely in the skin depth and hence a very small volume. As the volume heated is small, 
the temperature rise is large, but will decrease over a few microseconds as the heat diffuses 
into the bulk. The maximum power density deposited in the wall, Py, is given by 


AT = (3.93) 


Reur H2 
Py = mae (3.94) 


The temperature rise is given by 


2Pay t ulse 


where p is the density, « is the thermal conductivity and Ce is the specific heat of the wall 
material. As the surface is at an elevated temperature compared to the bulk, the thermal 
expansion of the surface layer will be constrained creating a high stress on the surface. The 
yield strength of copper is exceeded for temperature rises of around 50 K. As the stress 
is cyclic, surface cracking can occur due to fatigue, which in turn can cause increased RF 
losses, hence surface heating, and/or field emission. 


Acceleration 65 


C1 L1 Rs C1 L1 Rs c1 L1 Rs 


ce — Cc —— Cc =l Ce _l- 


FIGURE 3.23 Circuit diagram of a three-cell RF structure. 


3.7 Maulti-cell Cavities 


For practical accelerators, what is important is not the gradient but the real-estate gradient, 
which is the accelerating voltage divided by linac length including all ancillaries and drift 
tubes. Rather than having individual cavities, each with their own power couplers, vacuum 
pumps, and water cooling, it is preferable to have a series of RF resonators, which we will 
refer to as cells and for a multi-cell structure the term cavity refers to the entire structure of 
cells, coupled together such that a single power coupler is needed for each group of cells and 
the number of cavities/cells that can be fitted per metre of accelerator is increased. This 
increases the real-estate gradient as the spacing between cells is very short. Typically, for 
high-energy accelerators, a cavity is made of a circular waveguide with each cell separated 
by a metal disk with a hole for the beam to pass through. This hole, referred to as the 
beam aperture, can also be used to provide coupling between the cells. If we add more 
cells to a cavity but keep the field in each cell the same, both the voltage and dissipated 
power increase proportional to the number of cells hence the shunt impedance of a multi-cell 
cavity, Rs cavity is the impedance of a single cell, Rs ceu multiplied by the number of cells, 
N, cells 

Rs cavity = Rs, cett Neils: (3.96) 


In order to understand the behaviour of a chain of coupled RF cells it is convenient to 
analyse the equivalent circuit. Each cell can be represented as a resonant series RLC circuit 
as above but with the addition of an additional capacitive or inductive coupling between 
cells, as shown in Fig 3.23 for a three-cell cavity with capacitive coupling, Ce. The coupling 
can be via the electric or magnetic fields. If we chose to couple through the beam aperture 
in a TM110 mode, then the coupling will be via the electric field, represented by a parallel 
capacitance shared between the cell and its neighbour. Another possibility is to have holes 
in the walls between cells near the equator where the magnetic field is strongest, which is 
represented by a parallel inductance again shared by both cells. 

The effect of this coupling on the frequency of the cavity modes can be found by solving 
the eigenmodes of the circuit. Applying Kirchoff’s loop law to cell n, we obtain the following 
equation for the current in cell n, In, 


je eee nie een nee ae (3.97) 


where Z is the impedance of the RLC circuit of the cell and Z, is the impedance of the 
coupling capacitor or inductor. The solution to this equation is 


cos Qa = 1 + (3.98) 


2Ze 


where ¢, is the phase difference between the voltage in cell n and cell n + 1, known 
as the phase advance. If we solve for a cavity with a finite number of cells we find there 
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FIGURE 3.24 Field patterns for modes with different phase advances (dg = NT / 9), in a 9-cell cavity, 


where each bar represents the amplitude of the voltage in each cell. 


are a number of possible eigenmodes of the system, where the number of eigenmodes for 
each cavity is equal to the number of cells. The variation in the field amplitude in each cell 
for a range of standing-wave phase advances is shown in Fig 3.24. The set of eigenmodes 
of the multi-cell cavity for each eigenmode of a single-cell cavity is known as that mode’s 
passband. The frequencies of the eigenmodes for N identical cells are not all at the same 
frequency, as the different currents flowing through the coupling reactance for each mode 
provides a separation in frequency. For example, in the mode with zero phase advance, the 
currents flowing in each cell cancel in the coupling reactance, while in a mode with a 7 
phase advance the currents sum together. If we expand Z and Z, in terms of capacitance 
and inductance for an N cell cavity, assuming a coupling capacitance, and define a coupling 
factor k = C/C. we can find the frequency of each eigenmode as 


w? = wy (1 H 2k E cos (E), (3.99) 


where n is the eigenmode number (an integer from 1 to N) of each mode with phase 
advance na/N. The frequency versus phase advance for a multi-cell cavity with coupling 
factor, k = 0.3 and fr/2 = 1 is shown in Fig 3.25. Having a larger coupling factor, hence a 
larger coupling capacitance or inductance, provides a larger separation between the modes 
in the passband. A larger separation reduces the coupling to more than one mode at a given 
operating frequency, and hence the perturbation of the cavity fields from those other modes. 

It is necessary to ensure that the beam arrives at each cell at the same phase, hence 
the length of each cell, Leen can be calculated so that the phase change during the transit 
time is equal to the phase advance, based on the beam velocity, the cavity frequency and 
the phase advance of the operating mode. 


Leet = Poac (3.100) 
W 


It is possible to design an accelerating structure to operate in a cavity at any phase advance, 
but in order to maintain synchronism the cells must get shorter as the phase advance 
decreases. In a standing-wave cavity, any phase advance that isn’t a multiple of 7 (radians) 
will result in empty or partially-filled cells, reducing the gradient. The transit-time factor 
is also affected by the change in cell length as the phase advance is varied, resulting in a 
reduced gradient as the cells get longer with a transit-time factor of zero for a phase advance 
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FIGURE 3.25 Resonant frequency of a multicell RF structure as a function of the phase advance between 
cells for electric coupling, where the 77/2 mode is at 1 GHz. 


of 27. This means the ideal phase advance for maximum shunt impedance in a standing 
wave cavity is 7, hence these structures are known as 7-mode structures. If a mode with a 
27 phase advance is to be used then the beam must be shielded from the RF fields for at 
least half of the RF cycle, using drift tubes as mentioned earlier. This reduces their shunt 
impedance but for low particle velocities the cell lengths for other phase advances become 
too short to make a practical structure. 


3.7.1 Standing-Wave Cavities 


All the cells in a 7-mode standing-wave cavity should have the same field amplitude, however 
if one cell has a resonant frequency different from the other cells, that cell will have a 
different amplitude. The resulting eigenmode will be a hybrid of the required mode and its 
nearest neighbours. No physical structure can be made to infinite precision so in practice, all 
cavities will have finite variations in each cell’s frequency due to manufacturing or alignment 
tolerances. In addition the phase or amplitude of each cell may vary due to the finite 
resistance of each cell, depending on the phase advance. 

The spacing between the eigenmodes in the cavity varies sinusoidally with phase advance 
with minimum spacing at 0, 7, and 27 modes, hence these modes are the most affected 
by manufacturing tolerances. In contrast the 7/2 mode has the largest modal separation, 
and the modes are symmetric around it, meaning that this mode is the least affected by 
mechanical tolerances. The mode separation is also proportional to the cell-to-cell coupling, 
hence more coupling is preferred. As the number of eigenmodes in a cavity is proportional 
to the number of cells, longer cavities have larger variations in amplitude and phase over 
the structure, hence the cell-to-cell coupling is often increased for longer structures. 

The cell-to-cell coupling can be increased by using larger apertures in the disks either at 
the beam hole for electric coupling, or at the equator for magnetic coupling. Increasing the 
aperture however increases the peak electric and magnetic fields on that aperture, which 
will reduce the shunt impedance and increase peak fields, as can be seen in Fig 3.26. For 
higher-frequency cavities, there are more cells per unit length and hence more coupling is 
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FIGURE 3.26 Peak field versus coupling constant for different aperture coupling [28]. 


required for a given structure length. 

For high-gradient applications, 7 modes are preferred as they have a higher shunt 
impedance; however, they are the most sensitive to mechanical tolerances due to the 
smaller mode separation and the fact that the other modes are not symmetric in frequency 
around the m mode, requiring higher-cost precision machining or shorter structures. For 
long standing-wave cavities, or for industrial, medical or security accelerators where larger 
tolerances are required to keep costs low, a 7/2 mode cavity is often preferred. The 1/2 
mode, however, has every second cell unfilled, reducing the cavity shunt impedance by a 
factor of two if all cells are identical. In order to restore the shunt impedance, the unfilled 
cells can be modified so that their gap is made small to the beam. The two most common 
approaches to achieving this are the side-coupled cavity where the unfilled cells are placed 
offset from the beam axis, such that the filled cells become the same length as a 7 mode 
structure, hence restoring the shunt impedance, or bi-periodic structures [28] where the un- 
filled cells are made very short and the filled cells are lengthened to maintain synchronism, 
as shown in Fig 3.27. The side-coupled cell designed for the PROBE project [29] is shown 
in Fig 3.28, where the side-coupled cells have a small capacitive gap to reduce the cavity 
frequency for a given radius, allowing compact side-coupled cells. It can be seen that nose 
cones are added around the beam aperture in each cell. As discussed earlier the nose cones 
allow smaller gaps, and hence transit time factors, without losing synchronism with the 
beam or increasing the capacitance of the cells. This allows the cavities to have a higher 
shunt impedance. 

It is critical that the accelerating and coupling cells have the same resonant frequency in 
order for the coupling cells to provide a resonant coupling. If there is a frequency difference 
between the two cell types, the coupling cells will be capacitive or inductive instead limiting 
the coupling between any two adjacent accelerating cells, providing two separate passbands 
with fields either entirely in the accelerating or coupling cells, and the 7/2 modes becoming 
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FIGURE 3.27 Geometries of 7 and 7/2 modes in a cavity with all cells the same, as well as 7/2 modes 
in side-coupled and bi-periodic cavities. 


FIGURE 3.28 The side-coupled linac for the PROBE project [29]. 
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FIGURE 3.29 Dispersion curve for side-coupled linacs, with and without confluence. 


m modes losing all the advantages described above. When the frequencies of the two cell 
types are brought together, known as confluence, all cells are resonantly coupled. In this 
case the mode with fields only in the accelerating cells becomes a 7/2 mode, with a second 
m/2 mode with fields only in the coupling cells, and all other modes having fields in both 
types of cells. The Brillouin diagram (a plot of frequency against phase advance) for a 3 
GHz side-coupled cavity for the case of confluence, and the case of the side-coupled cells 
being off frequency are shown in Fig 3.29. 

For manufacturing errors where the coupling cells are accidentally all at a different 
frequency from the accelerating cells, the amplitude, Asn, in the accelerating cell 2n, with 
the coupler in cell 2m, is given by [30] 


PAE palm? =n?) a) 
Aon = (-1)""" Aam È = | pea. TAa (3.101) 
where Qa is the Q of the accelerating cells, Qe is the Q of the coupling cells, and Aw is 
a single cell frequency shift due to mechanical errors such that the accelerating cells have 
a different frequency from the coupling cells. Typically, a coupling factor of 1% to 5% is 
chosen to ensure the fields are not significantly perturbed by machining tolerances, with 
a coupling factor of 2.1 % used in the PROBE structure. As the field deviation from a 
perfect cavity increases along the length of a structure, larger coupling factors are required 
for longer cavities, which provide an ultimate limit in number of cells of around 20-30 cells. 


3.7.2 Travelling-Wave Structures 


As mentioned previously, phase advances that are not integer multiples of 180° result in 
partially-filled or unfilled cells for standing-wave cavities, as the fields from the forward 
and backward waves destructively interfere in some cells and constructively in others. This 
destructive interference can be avoided by using a travelling-wave instead, where the power 
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FIGURE 3.30 A sectioned CLIC accelerating structure operating in the 27 / 3 travelling mode; image 
courtesy of CERN. 


only flows in a single direction and is absorbed in a load at the other end preventing 
reflections. The power is fed into the travelling-wave structure via an input coupler and any 
remaining power is removed at the other end via an output coupler. To avoid standing waves 
forming inside the travelling wave structure due to reflections at the couplers, each must be 
carefully matched individually to the structure so that there are no reflections inside the 
structure. A true travelling wave in vacuum would have a high group velocity requiring too 
high a power flow to be practical, and the phase velocity would be greater than the speed of 
light making synchronisation with a particle beam impossible. To avoid this, the waveguide 
must be ‘loaded’ to slow the wave down in both group and phase velocity. Whilst this can 
be done with a uniform dielectric loaded waveguide [31], it is more common to load the 
waveguide with aperture coupled disks, known as a disk-loaded waveguide [32]. A cutaway 
of a disk-loaded travelling-wave structure for CLIC is shown in Fig 3.30 [16]. 

As the disks are periodic, the wave will be reflected at each disk but will cancel every 
couple of cells due to the periodicity. As such they are not true travelling waves as each cell 
will have a longitudinal field variation, making it closer to a chain of standing-wave cells 
with a phase advance between them. However, the magnitude of the electric field will be 
identical in each cell and the structure will have a net power flow in one direction unlike a 
standing-wave structure. Using Floquet’s theorem the field in each cell, Eez, is identical in 
each cell other than a phase shift, as shown in Fig 3.31 for a 27/3 phase advance, and can 
hence be described using the field profile in a single cell E, and the phase advance, a as 
[33] 

Ecz = E,(z)[exp(—i¢a) + Texp(ida)], (3.102) 


where I is the reflected wave from the coupler, which is ideally zero for a matched structure. 
The travelling-wave structure for the AWAKE booster [34], is shown in Fig 3.32 showing 
the cell amplitude at a fixed point in time repeats every three cells, and is hence a 27/3 
structure. The amplitude in each cell is constant but there is a phase difference between 
cells, so at any given point in time, the voltage in each cell will be different. The cell length 
and the phase advance is chosen such that the beam is always in the cell with the highest 
voltage. The length of each cell should satisfy 


Leet = 


Peha (3.103) 
w 

in order for the beam and wave to be synchronous. For low-energy electrons where 6 varies 

in each cell, the cell length should be varied rather than the phase advance in order to 

minimise reflections. Given this, we can evaluate the phase advance and internal reflections 

inside a structure given the field in each cell. If we take the sum, X, and difference, A, of 
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FIGURE 3.31 The real and imaginary components of the longitudinal electric field in a 277 / 3 travelling- 
wave structure. 
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FIGURE 3.32 The travelling-wave structure for the AWAKE booster. 


the fields in the cells either side of a given cell 


ye (E.(z Lett) t E,(z Leet) ) 
E,(z) i 
(E.(z Lett) E, (z Leet) ) 
A= 3.104 
E) (3.104) 
then the phase advance can be found using 
x 
cos(a) = (3.105) 
and the reflected signal can be found from 
2sin(ġa)— iA 
a ee =? (3.106) 


~ 2Qsin(@a) + iA’ 


It should be noted that the internal reflection I is not the same as S11, because if the 
input and output couplers are identical, the reflection from each coupler will cancel at the 
input giving S11 = 0 despite there being a reflected wave inside the cavity between the two 
couplers. For a matched travelling-wave structure, we hence require S11 =T = 0. 

Each cell has a power flow into the cell, Pu, power loss in that cell due to ohmic losses 
or beam loading, and a power flow out of that cell. If the power flow is much larger than 
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the other losses then the structure has a wider bandwidth, and the cavity behaves like a 
travelling wave with the filling time of each cell being short compared to the time for the 
power to flow through the structure. It is this increased bandwidth that makes travelling- 
wave structures insensitive to imperfections allowing longer structures to be used. They 
can be at least four times longer than a standing-wave structure. It is possible to load 
a short structure so that the group velocity, and hence the power flow, is much lower to 
increase efficiency. In such cases the individual cells fill slower, and reflections may occur 
during filling like a hybrid between a travelling- and standing-wave structure [35]. Due to 
ohmic losses the power flow decreases along the structure. The lower the group velocity, the 
higher the ohmic losses in the cell and hence the power flow will decrease faster along the 
length of the structure. If the structure is too long, the power will be too low in the end of 
the structure to achieve any usable gradient hence each structure has a maximum realistic 
length dependent on group velocity. If the structure is too short, then the power flow at 
the end of the structure will be large and will be absorbed in an RF load. However, having 
a lower group velocity also increases the stored energy per cell, and hence the gradient. 
Therefore for a given structure length, the group velocity should be chosen to maximise the 
average gradient. If the group velocity is chosen to be constant along the length, then the 
structure is said to be constant impedance. 

We can calculate the accelerating voltage for a travelling-wave structure starting with 
the relation between power flow and group velocity in a cell, 


dU 
P= Ug = .1 
“GT (3.107) 
The resistive power loss per unit length, P! is given as 
dP, 
Paea, .1 
= -7 (3.108) 


Considering the definition of Q factor and applying the power flow equation above we get 
dPy  wPy 
dz Qu, 


As the fields will decay exponentially along the structure, we can define an attenuation 
parameter, œo, as 


(3.109) 


Ww 


— . 
a v ; (3 1 0) 
dz = : 


Considering the definition of shunt impedance per unit length and applying the power flow 
equation above we get 


wP,, E? 
J = 2aP, = <. (3.112) 
Ug Te 


This can be rearranged to give the accelerating gradient in the a given cell cell, Facce, 


Face =V 2rsaPy. (3.113) 


Hence the power flow decays along the structure according to 


Pw (z) = Po exp (—2az). (3.114) 
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Where Pp is the power fed into the first cell via the input coupler. The exponent of the total 
decay along the structure, To, is given by 


wL 
=aL= : 3.115 
=ne (3.115) 
As the power flow decays, so does the accelerating field 
dEace 
= —Q Facce, (3.116) 
dz 


and hence the accelerating voltage is reduced, and can be found by using this equation to 
find the accelerating gradient along the structure length and integrating leading to 


are") 


To 


Vace = Eo cos Q. (3.117) 
where Fo is the field in the first cell. Hence, taking the equation for the accelerating gradient 
in the first cell into account we obtain 


1—e-7 
Vace = V ar PE cos Q. (3.118) 


In the constant-impedance structure described above, all cells are identical. The gradient 
can be improved for a given input power and structure length by tapering the group velocity 
along the structure. For reasons of making power dissipation and breakdown uniform the 
optimum case is for the group velocity to be tapered such that the gradient is constant in 
each cell [36]; hence these are known as constant-gradient structures. Each subsequent cell 
will have a slightly lower group velocity than the one before it to account for RF losses in 
the previous cell. For a given input power, the group velocity in the first cell, and hence 
all subsequent cells, is chosen to be as low as possible to achieve the maximum gradient in 
each cell whilst still allowing a small amount of power to reach the final cell. The group 
velocity in each following cell has to be matched to the cell before it such that the gradient 
is the same. If the power flow is too low the structure will experience reflected power, lower 
bandwidths and more sensitivity to tolerances. 

In both cases the group velocity is normally varied by changing the aperture radius but 
can also be varied by altering coupling slots placed in the disk near the cavity walls [35]. 
When increasing the aperture or coupling slot radii, the two adjacent cells are coupled via 
either electric or magnetic fields respectively, and the increased surface currents around the 
opening lead to higher peak electric and magnetic fields. Higher peak electric fields may 
result in breakdown and higher peak magnetic fields will increase the ohmic losses and 
hence decrease the shunt impedance. The input and output couplers must be matched to 
the structure so that the impedance of the coupler appears the same as an infinite structure, 
therefore ensuring there are no internal reflections. There will be a small reflection as the 
adjacent cell fills but this will be small for most structures due to the high power flow along 
the structure. The coupler should be designed to have an external Q factor, Qe, given by 


Qe = = (3.119) 


Ug 


For low-energy electron linacs, such as those used in radiotherapy, the electron velocity 
will change along the structure’s length. In such cases the distance between the disks can 
be varied such that the phase velocity can be changed without changing the phase advance 
of the linac, and hence retaining the travelling wave without internal reflections. 
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3.8 Wakefields 


When a relativistic electron beam travels down a conducting beam pipe, it generates an 
image current which travels with the beam. When the beam reaches a change in the cross- 
section of the beampipe, such as a cavity, the image current must take a longer conduction 
path hence it will slip behind the bunch. There will be a decelerating or deflecting force on 
the beam as it moves away from the image charge/current, and the beam will lose energy 
to the cavities’ fields driven by these surface currents and charges. When the beam leaves 
the cavity. the energy transferred to the cavity modes will remain, and can interact with 
later bunches, or later particles within the same bunch. This effect is known as a wakefield, 
as it is the field left in the wake of the bunch, and can cause serious disruption in the beam. 

Wakefields are discussed in Chapter 7 so here we limit ourselves to the RF effects. 
Wakefields will radiate over a wide range of frequencies, including the operating frequency 
of the cavity. This will cause a change in the cavity’s operating modes fields known as beam- 
loading. Any radiation into other resonant modes of the cavity must be damped to avoid 
them growing to levels where they cause the beam quality to be degraded, which we will 
discuss later in this chapter. 

If a cavity is driven by an external RF source then the wakefield will be superimposed 
on the driven fields in the cavity. If the beam current is synchronised with the peak of the 
RF voltage, known as on-crest, then only the amplitude of the operating mode will change, 
while if the beam is off-crest, there will be both a phase and amplitude change. The change 
in field will also change the matching conditions for the cavity, causing reflections at the 
coupler and hence requiring a change in external Q to re-match the cavity. 


3.8.1 On-Crest Beam-Loading 


For the case where the beam is on-crest, cavity behaviour can be described to the first order 
with some minor modifications to the equations without beam. In this case the beam-loading 
can be modelled as purely resistive, although it could have a negative resistance for the case 
of decelerating. The power transferred from the cavity to the beam in the cavity, ignoring 
the change in the cavity voltage due to the wake within a single bunch, is approximately 
given by 

P, = Vaecl (3.120) 


where Vace is the accelerating voltage and J, is the beam current, which must be replaced 
by the RF source, along with the power to replace ohmic losses in the walls, to maintain the 
cavity voltage. Looking at the cavity and beam from the RF source, cavity ohmic losses and 
on-crest beam-loading is indistinguishable, and hence we can define a new coupling factor 


Pe 
= .121 
Bo P. +P, (3.121) 
and hence the reflected power can be given as 
U 1-6 i 
w — Pb 
P, = —— Pr ——— 3.122 
Qe i ( 1+ z) ( ) 
and the stored energy becomes 
4P;6? Qe 
Uy = I Qe (3.123) 
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3.8.2 Off-Crest Beam Loading 


If the beam current is not in phase with the RF voltage, then the beam-loading gains a 
reactive component, either capacitive or inductive depending on the side of the crest on 
which the beam arrives. As such, the beam-loading will change the phase of the RF as well 
as the amplitude. Additional RF power will be required due to the reactance, as it will cause 
reflections at the input coupler. A similar effect will occur if the generator frequency and 
the cavity frequency are different, with the cavity presenting a reactance to the generator. 
It is useful to define a detuning angle, Y, given by 


tan yb = 20, (3.124) 


Considering the power transferred to the beam as well as the reflections due to the 
change in reactance, or generator detuning, the required RF power, P}, to keep the voltage 
constant is [30] 


2 a 2 
P, = P. (1 + Bye i |(c $. 4 Ker) te (sins, aa ine) : (3.125) 


46 cos? Vace Vace 


where @, is the phase shift between the cavity voltage and the beam current, P, is the power 
required without the beam and V, is the beam-induced voltage in the cavity given by 


ae Tyrs cosw 
1+ £6 


The additional required power can be corrected by tuning the cavity to a different resonant 
frequency to cancel out the beam’s reactance. In this case the cavity should be detuned by 
w IR, sin ds 


A 
tan = -2 = -7 + By (3.127) 


(3.126) 


3.9 Superconducting RF 


In 1911 Kammerlingh Onnes discovered that the resistance of some materials disappeared 
when cooled by liquid helium to a temperature of 4.2 K but it wasn’t until 1956 that this 
effect was explained by Bardeen, Cooper and Schrieffer in what is known as BCS theory 
(after their initials), which won them a Nobel prize in 1972. Electrons are fermions and 
thus obey Fermi statistics; this means that the Pauli exclusion principle holds and only 
two electrons with opposite spins can occupy each energy level. However, in some materials 
there is a transition temperature, Te, below which electrons with opposite spins experience 
an attraction via lattice vibrations and become weakly bound. The transition temperature 
for niobium, the most common SRF material, is 9.3 K. The bound electrons are known 
as Cooper pairs, which obey Bose-Einstein statistics; hence the Pauli exclusion principle 
no longer applies and the Cooper pairs can occupy the same energy state. The Cooper 
pairs all flow as one with the same velocity and the same direction and are not scattered 
by impurities, hence the material has zero resistance to DC currents. A key aspect of a 
superconductor (as opposed to an ordinary very good conductor such as gold) is that when 
cooled below its transition temperature, all magnetic fields will be expelled. This is known 
as the Meissner effect, and is caused by supercurrents flowing with no resistance to shield 
the magnetic field, which is energetically more favourable than allowing the field to enter 
inside the superconductor. 
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When cooled to absolute zero, all the electrons are bound in Cooper pairs, while at 
temperatures between zero and the transition temperature, some of the electrons remain 
unpaired and behave like normal electrons [18]. This can be considered as two conducting 
fluids in parallel, one with the normal conducting conductivity and the other with the 
superconducting state of conductivity. In a DC case, as the superconducting conductivity is 
so much higher, almost all the current is carried by the Cooper pairs which can flow without 
resistance. However, when an RF field is applied to a superconductor, the resistance is not 
zero, although it is very small. While the Cooper pairs move without friction they do have 
mass and inertia. Because of the inertia, the Cooper pairs do not screen applied time- 
varying fields perfectly as there is a delay between the current reversing direction and when 
the electric field is reversed. A time-varying electric field penetrates a small distance into 
the surface due to induction from the time-varying magnetic field inside the surface. This 
causes a small power dissipation as the fields at this depth can cause the normal electrons 
to carry some of the current. London derived two equations to describe the behaviour of a 
perfect conductor. London realised that the condition for the magnetic field expulsion is 


nse? 


V x js + ——B =0, (3.128) 


m 
where n, is the number density of Cooper pairs and js is the current density induced in the 
superconductor’s surface by an electric field E. This is known as the 2nd London equation. 
Using the London and Maxwell’s equations we can show that the field will penetrate a 
short distance into a superconductor, known as the London penetration depth, Az, where 
the magnetic field parallel to the surface will decay though the superconductor as 


x 

H, = H, exp (-=) ; (3.129) 
ÀL 

where x is the distance from the surface. The London penetration depth for niobium, the 
most common superconducting (SRF) material, is 36 nm. London also postulated that the 
rate of change of the current in time was proportional to the applied electric field; 

ðjs nse? 

ðt m 


(3.130) 


known as the 1st London equation. This means that when an RF field is applied, the current 
will be out of phase with the voltage, and hence the surface impedance has both resistance 
and reactance. The surface reactance, X, is given by 


Xs = wHoàL. (3.131) 


The dissipated power can be given in terms of a surface resistance, which is much smaller 
than the reactance. The resistance of a superconductor at RF frequencies was derived from 
Bardeen, Cooper and Schrieffer and is hence known as the BCS resistance, Rgcs : 


w? A 
R = A— —— 3.132 
Bos = Az exp ( =r) ; (3.132) 
where A is the band gap of the superconductor, T is the temperature, and A is a material 
dependant constant. It is generally found that the surface resistance is proportional to the 
conductivity of the normal conducting state. From this equation it can be seen that the 
BCS surface resistance has the following dependence: 


e Recs increases x f*, shown in Fig 3.33; 
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FIGURE 3.33 BCS surface resistance as a function of frequency for a niobium cavity at a temperature 
of 1.8 K. 
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FIGURE 3.34 BCS surface resistance as a function of temperature for a niobium cavity at a frequency 
of 1.3 GHz. 
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e Recs increases exponentially with temperature, shown in Fig 3.34. 

Superconducting RF (SRF) cavities have higher losses as they increase in frequency; 
for this reason there are few SRF cavities in accelerators with operating frequencies above 
4 GHz. As SRF cavities have a very high ohmic Q factor, they also have a much larger 
shunt impedance (which relates voltage to power dissipated in the cavity) than a normal- 
conducting (NCRF) cavity. This means we can avoid nose cones and can have larger irises. 
As SRF cavities often operate at lower frequencies and have larger irises — and hence have 
lower geometric shunt impedance — SRF cavities have much lower wakefields. However in 
SRF cavities, any energy induced remains undamped in the cavity for long periods of time 
without special additional couplers. The most common SRF material is niobium (Nb), which 
has BCS resistance [18] 


Racs|Q] = 2 x 0s (25) exp (FE) : (3.133) 


More recently studies have shown that doping or infusing Nitrogen into Niobium, by 
annealing in a partial pressure of Nitrogen followed by an electropolish, can provide a lower 
surface resistance that the predicted limit for bulk Niobium [37] 


3.9.1 Residual Resistance 


In addition to the BCS resistance there can be additional losses due to impurities or surface 
layers known as the residual resistance, Rres. The total resistance is given by 


Reotal = Recs + Rres: (3.134) 


Typically, a clean cavity operating at a frequency of around 400 MHz will have a residual 
resistance between 1 and 10 nQ and the operating temperature is chosen such that the BCS 
resistance is less than the residual resistance. This means that low-frequency cavities (below 
500 MHz) will usually operate at 4.2 K, which is the boiling point of He at atmospheric 
pressure. At higher frequencies (such as 1.3 GHz) the operating temperature is reduced to 
~2 K. As SRF cavities typically operate with resistances between 1 and 10 nQ, this gives 
Q factors over 10° for elliptical cavities. The residual resistance is thought to increase with 
cavity frequency as well [38]. 

A major cause of residual resistance is flux pinning. If we look at what happens as we 
apply an external magnetic field to a normal conductor and cool it to a perfectly conducting 
state we can see that the flux lines become ‘frozen in’ where the conductor becomes magne- 
tised. This causes a problem if a cavity with normal conducting impurities is cooled in the 
presence of an external magnetic field, Hac, (such as the earth’s magnetic field). Supercur- 
rents flow around these trapped magnetic fields. When an RF magnetic field is applied to 
the cavity, the field lines will oscillate which leads to an increased surface resistance. The 
additional resistance is given by 


Rres = am Hac y f|GH2], (3.135) 


where @,, is around 0.2-0.3 nOQ/mG for Niobium. For this reason SRF cavities are normally 
shielded from all external magnetic fields. Cavities with a thin-film Niobium coating tend to 
be less sensitive to external magnetic fields. Residual resistance can also be caused by ohmic 
losses or dielectric losses in the impurity itself, such as a copper inclusion, where the gener- 
ated heat is transferred to the superconductor, raising its temperature and hence the local 
BCS resistance. Recent studies have suggested that the creation of thermo-electric currents 
due to large temperature gradients when cooling the cavity down can cause magnetic fields 
which increase the residual resistance [39]. 
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3.9.2 SRE Field Limitations 


Due to the sharp increase in RF surface losses with temperature, the superconducting state 
is very delicate at high field. Heating caused by RF or electron phenomena can lead to a ther- 
mal runaway, although it should be noted that the thermal conductivity is also temperature 
dependant. There is also the possibility of phase transitions between the superconducting 
and normal conducting states at high field. SRF cavities are limited by these effects to just 
over 50 MV/m depending on the ratio of surface fields to gradient, however ~30 MV/m is 
currently about the maximum gradient achievable repeatedly for accelerator applications. 
The European XFEL chose a design gradient of 23.6 MV/m to ensure a high manufac- 
turing yield, i.e. so that most cavities either achieve or exceed the design gradient [40]; 
the International Linear Collider (ILC) has a global R&D program to demonstrate reliable 
performance at 31.5 MV/m [41]. 


Critical Magnetic Field 


The superconducting state is more ordered than the normal conducting state, hence it 
has less Gibbs free energy, which is the maximum amount of reversible work that can 
be performed by a system at constant temperature. When an external magnetic field is 
applied, supercurrents flow in the penetration depth to cancel out the fields in the interior. 
This causes the Gibbs free energy to rise in the superconductor quadratically with field for 
the superconducting state. When the field is increased to a level where the free energy of 
the superconductor is equal to the free energy of the normal state the two phases are in 
equilibrium. This occurs at the thermodynamic critical magnetic field, He, above which all 
the flux enters the superconductor (although as we will see below, this is modified by surface 
energy barriers). At this point the cavity is no longer superconducting and the cavity is said 
to have quenched. The critical field varies with temperature as 


H,(T) = He(0) i . ei 


The transition temperature T, is the temperature where the superconductor changes be- 
tween the normal and superconducting state. 

There is also a surface energy barrier at the interface between the superconductor and 
a normal conducting region. This surface energy can be positive or negative. Type-I super- 
conductors have a positive surface energy and all fields will enter the superconductor at 
He. In Type-II superconductors — such as niobium — the negative surface energy makes it 
energetically favourable for a normal conducting fluxoid to enter the superconductor at a 
lower magnetic field, known as He creating small normal conducting flux tubes inside the 
superconductor which mostly remains superconducting. As the external magnetic field is 
increased, more fluxoids penetrate the superconductor in an ordered lattice. The fluxoids 
have a finite size given by the coherence length, which is the length scale of changes in the 
superconducting state, so eventually there will be so many fluxoids that they will touch 
each other and all flux will enter the cavity. This happens at a field higher than He, known 
as H.2. The superconducting state can exist meta-stably in a superheated state higher than 
Ha at RF frequencies up to the RF critical field, roughly equal to He for niobium at 2 K. 
For niobium at 0 K, we have Hea =170 mT, He =200 mT, and H,2 =240 mT, and the 
coherence length is 64 nm. 


(3.136) 


Thermal Breakdown of Superconductivity 


Thermal breakdown is when a superconductor abruptly becomes normal conducting, similar 
to a quench, caused by the surface temperature exceeding Te. This is a runaway effect as 
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FIGURE 3.35 Qo plotted against Facce for two cavities (one good, one poor) from CERN’s bulk niobium 
high gradient SRF programme; image courtesy of CERN [42]. 


a superconductor’s temperature raises its surface resistance, which hence increases power 
dissipation and temperature; this heats the surrounding area which in turn has its resistance 
increase. The main cause of a thermal breakdown is the heating of normal conducting 
impurities on the cavity surface (which heats the superconductor around it), or by heating 
due to field emission where the emitted electrons impact on the cavity surface depositing 
their energy. As mentioned previously, the temperature gradient between a cooled wall (in 
this case by liquid helium) and the RF surface where heat is applied is proportional to the 
wall thickness and a thermal boundary resistance known as the Kapitza resistance, hence 
this effect can be reduced by using thinner walls. The downside of this is making the cavities 
mechanically weak and prone to deformation. As a compromise, most SRF cavity walls are 
3-4 mm thick. Another solution is to place a thin film of superconductor inside a copper 
cavity. As the copper has a high thermal conductivity, they can have thicker walls for a given 
temperature rise. However, coating a cavity with a superconductor is a developing field and 
cavity performance is still not comparable with bulk niobium cavities, but is suitable for 


low gradient applications, such as synchrotrons like the LHC. 


Field Emission 
In the presence of impurities or defects we also have an electric field limit. This is caused 
by field emission of electrons from regions of high electric field. These electrons will impact 
on the cavity surface which will locally increase the cavity temperature, leading to a higher 
surface resistance at that location. Impurities or defects can cause sharp points on the cavity 
surface, known as field emitters, which have a field enhancement on the edges causing higher 
local surface fields leading to field emission. The field emission is also usually accompanied 
by X-ray emission which can therefore be used to determine if field emission has occurred. It 
can be seen in Fig 3.35 for the case of the poor cavity that the Q factor drops off steeply in 
an SRF cavity when it starts to field emit as the heating leads to a higher surface resistance, 
while in the good cavity, field emission starts at a much higher field, likely due to a cleaner 


surface. 
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As previously mentioned, multipactor can also be a limiting factor in SRF due to the 
additional heat load. Pure niobium has a peak secondary emission yield of around 1.2, how- 
ever, any oxide layers or contamination on the surface can increase the secondary emission 
yield above 2.0. Fortunately the electron impacts caused by multipactor act to clean the 
cavity surface, hence conditioning the surface and reducing the secondary emission yield. 
Multipactor that persists after a few hours of conditioning is known as a hard multipacting 
barrier, while multipactor that dissipates after conditioning is known as a soft multipacting 
barrier. 


3.9.3 Cavity Cleaning 


In order to achieve high gradients, the cavities must be specially cleaned to remove any 
particulates which could lead to field emission or increased surface heating. They must be 
washed with ultra-pure water (in clean rooms), rinsed using high-pressure water jets and 
have the walls smoothed using acid. Two methods of surface preparation have been devel- 
oped for surface preparation of SRF Nb cavities. The first is buffered chemical polishing 
(BCP). This method uses a mixture of three acids: hydrofluoric acid, nitric acid and phos- 
phoric acid. Nitric acid reacts with Nb to form niobium pentoxide, Nb2O;. Hydrofluoric 
acid reacts with the pentoxide to form niobium fluoride, NbF', which is soluble, creating a 
polished Nb surface. The phosphoric acid serves as a buffer to help keep the reaction rate 
constant. The other surface preparation method is electro-polishing (EP). Here an acid mix- 
ture of mostly sulphuric acid with some hydrofluoric acid is used; the cavity acts as an anode 
and a cathode electrode is placed inside the cavity, with a potential difference of 10-20 V 
applied, which activates the polishing process. The enhanced electric field at any protrusion 
will cause the Nb surface to oxidise there first, thereby smoothing the surface [18]. 


3.9.4 Microphonics and Tuners 


All mechanical structures have mechanical resonances, where the transfer of mechanical 
vibrations from the source (such as a vacuum pump) to the structure is enhanced. In SRF 
these effects are called microphonics. As mentioned previously, SRF cavities have very high 
Q factors giving very small bandwidths, usually less than 1 Hz. Mechanical vibrations 
coming from ground motion, vacuum pumps and other environmental noise will cause the 
resonant frequency to vary in time by up to 1 kHz, which is three orders of magnitude 
more than the cavity’s bandwidth. When testing a cavity without beam, the LLRF system 
can rapidly vary the drive frequency to follow the cavity frequency, but in an operating 
accelerator the drive frequency is fixed. This requires the cavity bandwidth to be increased 
by decreasing the external Q of the fundamental power coupler. 

The klystron frequency, which is normally fixed to a stable reference clock, must be 
within the cavity bandwidth, hence the cavity frequency must be tuned within that range 
by squashing its shape using a mechanical tuner. As the bandwidth is so small, any small 
perturbation must be accounted for. In order to fast tune for microphonics, the mechanical 
tuner can be fitted with a piezoelectric crystal which expands or contracts depending on a 
voltage applied to it, allowing a smaller cavity bandwidth to be used. 


3.9.5 Cryogenics 


The ohmic losses may be much lower for SC cavities than normal conducting cavities by 
a factor of around 10°, however significant additional electrical power is required in the 
system to remove the heat and/or to re-condense the helium as cryogenic refrigerators are 
very inefficient. The cryogenic system requirements reduce the efficiency of superconducting 
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structures although they are still more efficient overall than normal conducting structures. 

All refrigerators have a technical efficiency, nr, of 20-30 %. In addition, we are limited 

by the Carnot efficiency, ne, which is the maximum theoretical efficiency any heat engine 

working between a hot, Thot, and cold, Teoia, temperature reservoir can operate at, given 

by 

PS7 Teold 
Thot = Teold 


The dynamic heat load, P., is the RF power dissipated in the cavity walls by the RF fields. 
A static heat load, P,, adds additional heating (the static heat load is the power dissipated 
with no RF in the cavity due to supports and other connections). 

Liquid helium transfer lines are another static heat load that typically requires ~0.1 W 
per metre of cooling (although some flexible connections may have higher losses), so total 
loss is length L multiplied by loss per metre, i.e. 0.1 L. It is standard to fill to an overcapacity, 
O, in case extra cooling is required. Hence we can calculate the total electrical power needed 
for cooling each cryostat, Poryo, as 


Ne (3.137) 


B — OPe+ Py + 0-12) aT 
cryo NTN $ ` 


Apart from the power required to extract the heat, SRF cavities have very few problems 
operating with long pulses at their maximum gradient; hence SRF cavities are currently 
favoured for CW (continuous) applications. 


3.10 RF Couplers 


The RF is coupled into the cavity from the waveguide via a fundamental power coupler 
(FPC). The interface between the cavity and the coupler can couple via the electric fields, 
magnetic fields or both. The coupler can come in rectangular waveguide or coaxial config- 
urations. In the case of waveguide couplers, the field in the waveguide mode (normally the 
TE 9 mode) should be matched to the fields in the cavity, with electric and/or magnetic 
fields aligned in the same direction on either side of the interface. The coupling between 
the electric fields can be found by matching the cavity field at the interface Ecay to an 
expansion in terms of the modes inside the coupler 


Ecav = 5 an En, coup; (3.139) 
n=1 


where Ey coup is the electric field of the nth waveguide mode at the same interface and an is 
the amplitude of that waveguide mode. Similarly, the magnetic field at the cavity interface, 
Beav, is expanded as 

Beav = 5 bnBn,coup (3.140) 


n=1 


where Bn coup is the magnetic field of the nih waveguide mode at the interface and bpn is the 
amplitude of that waveguide mode. This equation can be solved for each waveguide mode 
to find the coupling to each mode. 

For coaxial couplers we have a choice in the geometry at the end of the coupler where 
the cavity and coupler meet, that we can optimise to ensure the cavity is critically coupled. 
If we leave the inner conductor un-terminated with no connection to the outer conductor 
(known as probe termination), as shown in Fig 3.36, then the electric field of the cavity can 
create a charge difference between the inner and outer conductor which varies with time, 
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FIGURE 3.36 The three types of termination for a coaxial coupler: probe, loop and hook. 


hence acting as a current source in parallel with the capacitance between the inner and 
outer conductor. The current, I, is given by 


dQ d frip E-dS 


a oe ae (3.141) 


where E is the electric field on the tip of the inner conductor and S is the surface area of 
the tip of the inner conductor. If we connect the inner conductor to the outer conductor 
via a loop then the magnetic field can create a voltage across the hook loop via magnetic 
induction. This has an equivalent circuit diagram of a voltage source in series with an 
inductor. The voltage is given by 


5 df, Bas 
V= = = J = (3.142) 


where ® is the magnetic flux through the loop. A magnetic loop has difficulties in assembly 
as the inner and outer conductors need to be joined. It is also possible to instead have an 
inductive hook at the end of the coaxial lines inner conductor that has a small capacitive gap 
between itself and the outer conductor, also shown in Fig 3.36. Such a termination can be 
excited by both electric and magnetic fields, however each has a slightly different equivalent 
circuit. For the hook, the inductor and capacitor are in series with each other, however for 
magnetic field coupling, this series LC circuit is also in series with the voltage source and 
for electric coupling, the series LC circuit is instead in parallel with the current source. As 
the capacitor, Cgap, and inductor, Lioop, are in series, they form a resonant circuit which 
acts as a bandstop filter for electric fields and a bandpass filter for magnetic fields, with a 


resonant frequency 
1 


E LicopC gap 
The equivalent circuit for each type of coupling is shown in Fig 3.37. The choice between 


types will depend on the chosen coupler location, the cavity fields at that location, and the 
RF heating on the coupler tip. 


wp (3.143) 


3.10.1 Fundamental Power Couplers 


The RF is fed into the cavity via a fundamental power coupler (FPC), which is designed 
to handle high power flow. By varying the geometry of the coupler, hence altering their 
capacitance and/or inductance, we can vary the external Q of the coupler, in order to 
match the RF systems. For high-frequency normal conducting cavities, the FPC is almost 
always waveguide for power handling reasons, while for low-frequency cavities (below 400 
MHz) coaxial coupling is preferred to reduce the size. The couplers can be placed in the 
cavity equator, known as on-cell couplers, or beside the cavity to couple via the beam 
pipe. SRF cavities normally prefer coaxial couplers, even at higher frequencies, to reduce 
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FIGURE 3.37 Equivalent circuits for electric and magnetic coupling. 


the heat transport through large waveguides, although for synchrotrons where high power 
is required, rectangular waveguide couplers are sometimes used as it avoids the problem 
of cooling the inner conductor. The presence of a coupling hole near the cavity equator 
enhances the magnetic field and may cause premature thermal breakdown in the case of 
superconducting RF cavities; hence, SRF couplers are normally placed in the beampipe 
away from the cavity, although some low-field SRF cavities use on-cell couplers. 

In normal-conducting cavities, couplers in the beam pipe can either be placed next to the 
cavity so that the waveguide couples via the iris, or separated from the cavity via a longer, 
larger diameter circular waveguide in the beam pipe (such that the beam pipe is not cut-off) 
known as a mode launcher. The advantage of a mode launcher is that the structure that 
couples the rectangular waveguide to the circular waveguide can be manufactured separately 
and connected to the cavity via a flange, although it takes up more space longitudinally. 
In many linacs there is a requirement to make the fields as symmetric as possible to avoid 
a transverse electric or magnetic field on the beam axis which may disrupt the beam. To 
avoid this two transversely opposing waveguide feeds are often used so that power is fed 
from both sides. 

For SRF couplers the design is complicated by the requirement to minimise the heat con- 
duction between the room-temperature interface and the liquid helium vessel. To minimise 
the thermal conduction, couplers are often made from steel with a thin coating of copper 
to minimise ohmic losses on the RF surfaces. For a given coupler length, it is inefficient 
to simply have a temperature gradient between the cold and warm parts; typically there 
are several stages held at fixed temperatures by cooling with liquid helium at the lowest 
temperature stage, then helium gas or liquid nitrogen at an intermediate stage in order to 
minimise the heat deposited at the lowest temperature. Due to the temperature gradient, 
bellows must be used to allow the coupler to thermally contract when cooling down. 

In addition, to keeping the cavity clean, the coupler will have one or two RF windows 
which are transparent to RF but which are vacuum tight. The windows will be made from a 
high-resistivity ceramic — such as alumina (aluminium oxide) or beryllia (beryllium oxide) — 
meaning that the windows have the problem of charging up if they are struck by electrons; 
hence, care is taken to avoid any line of sight from the beam to the window. However, the 
window can still be impacted by electrons due to field emission causing them to charge up. 
This leads to the possibility of multipactor, vacuum arcs, or flashover — the latter where 
electrons are attracted to the charged ceramic, which on impact produces more secondary 
electrons, leaving a net positive charge, which in turn are also attracted to the ceramic by 
the positive charge to give an avalanche. These phenomena can lead to coupler damage, and 
eventually window metallisation or detuning of the coupler. Multipactor can be avoided in 
coaxial couplers by providing a DC bias between the inner and outer conductors. Another 
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major cause of window failure is mechanical stress caused by thermal gradients along the 
window. 

Many coaxial couplers for SRF cavities operating at frequencies above 0.4 GHz will 
connect to a rectangular waveguide, and hence a special coupler known as a doorknob is 
used to transition between the coaxial line and the waveguide. All of the features of an FPC 
need matching to the RF at the resonant frequency which results in the coupler having a 
narrow bandwidth. 

Common causes of failure in superconducting cavity FPCs include: 


e Vacuum leaks/cracked window; 
e Overheating; 

e Arcing/breakdown; 

e Window metalisation; 

e Multipactor; 


e Band-pass detuning. 


3.10.2 HOM Couplers 


As was discussed earlier, a bunch of charged particles will decelerate and deposit RF energy 
into undesired modes in the cavity, in a process known as wakefields. These wakefields 
excite the higher-order modes (HOMs) of the cavity (i.e. modes of higher order than the 
fundamental TM119 mode of the cavity), which can then have unwanted effects on the beam. 
In order to reduce the effects of these wakefields it is necessary to damp (reduce the energy 
in) these modes using special couplers to remove this power. HOM couplers are designed to 
couple power from cavity HOMs out of the cavity to a resistive load. However, they must 
not take power out of the cavity at the fundamental frequency. To avoid this, the coupler 
must use a high-pass or band-stop filter. This can be implemented in two ways: 


e use a waveguide with the cut-off frequency above the fundamental frequency (high- 
pass); 

e use a band-stop filter in a coaxial line using inductive stubs (metal cylinders connecting 
the inner and outer conductor) on the inner conductor with a small capacitive gap in 
the stub. 


Waveguide couplers are often larger than coaxial couplers but can handle higher powers. 
They are very simple, but their size can often be a problem in SRF applications as it 
provides a thermally-conducting path between the cold and hot parts of the cryomodule. 
Waveguide couplers often have stronger coupling to the HOMs, and can handle higher HOM 
powers with less RF loss, are simpler to cool, and have less chance of electron activity such 
as multipactor and are hence favoured for high-current applications. 

Coaxial HOM couplers are very complicated and include many inductive stubs and 
capacitive gaps in order to minimise coupling at the fundamental frequency and maximise 
coupling at the most problematic HOMs. If the inner conductor is large enough, it may be 
possible to have water or helium flow inside of it for cooling. The complicated geometry can 
also cause problems with multipactor or arcing. 

If the beam current is high enough where wakefields are an issue in normal conducting 
cavities waveguide couplers are mounted on every cell, with an RF load composed of a Silicon 
Carbide (SiC) wedge installed in each waveguide. For very high-current applications, RF 
absorbers can be placed in the cavity beam pipe allowing frequencies above the waveguide 
cut-off to be strongly damped. Modes with frequencies less than the beam pipe cut-off will 
decay exponentially in the beam pipe, with the decay sharper at lower frequencies, hence 


Acceleration 87 


at frequencies close to the cut-off the mode can still be damped if the fields haven’t decayed 
before the absorbers. 


3.10.3 Coaxial HOM Couplers 


The complex pass-band structure of coaxial couplers are often modelled using equivalent 
circuits. Like FPCs they can have capacitive or inductive coupling. The reactive coupling 
element will reduce the power deposited in the load, but at a single frequency the reactive 
element can be compensated for by using another reactive element with the opposite sign. 
Capacitive coupling can be compensated with a parallel inductor, taking the form of a 
stub, and an inductive coupler can be compensated with a series capacitance (a gap). The 
compensation frequency, Weomp, for capacitive coupling (with capacitance C) compensated 


with a stub is given as 
1 


Weomp = L.C 
c 


where Le is the inductance of the compensating stub. The reactance of an element, of 
impedance Zs, can be varied by using a transmission line, of impedance Ze and length L, 
between the element and the measurement point. The impedance, Z, at the measurement 
point is given by 


(3.144) 


Z,+%1Z.tank,L 
Z.—iZ,tank,L 


As it is easier to implement a stub than a gap — since stubs also provide mechanical support 
and cooling while gaps need additional support structures — any coupling element can be 
compensated by a stub and a length of transmission line [17]. 

Compensating at one frequency to get higher transmission, by cancelling the antenna’s 
reactance with a component with the opposite reactance at that frequency (normally the 
frequency of the highest shunt impedance HOM), will cause the reactance to be higher at 
other frequencies, as a capacitor’s reactance will decrease with frequency while an inductor’s 
reactance decreases with frequency. This can also result in stopbands, where no power is 
transported in a finite frequency band, due to resonances between two reactances separated 
by a distance at high frequencies where the gap is comparable to a quarter or half of the 
wavelength depending on the exact components. 

In order to filter the fundamental mode frequency we can place a gap between the stub 
and the outer conductor, giving a capacitance, C's i series with the inductance, as shown in 
Fig 3.38, with the filter centre frequency, wy given by 


Z=. (3.145) 


1 
Wwe = 
f LoC; 


(3.146) 


where Cy is the filter capacitance. The addition of this capacitance will slightly alter the 
compensation frequency as well. 

A real HOM coupler for the LHC crab cavities [43] is shown in Fig 3.39. Here a hook 
coupler is used as the coupler is placed in a region of high magnetic field but low electric 
field, with a hook chosen over a loop in order to have the couplers be demountable. A 
cylindrical electrode is placed between the inner and outer conductor, supported by a stub 
on the inner conductor, with the capacitance and inductance chosen to reject any coupling 
at 400 MHz (the cavity’s operational frequency). The inner conductor has a large radius 
and is attached to the top of the can to provide strong cooling. The coupler bends by 90° 
at the top, before having a capacitive gap between the inner conductor and the pick-up. 
By altering the distances between elements, we can create a high-pass filter and provide 
additional damping at the frequencies of the most dangerous HOMs. 
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FIGURE 3.38 Equivalent circuit of a coaxial HOM coupler. 
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Coaxial HOM coupler for the LHC double quarter-wave crab cavity. 
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TABLE 3.2 Comparison of CLIC and ILC parameters. 


Parameter Units CLIC ILC 
structure type 27/3 TWS | Coupled-cavity SW 

frequency GHz 11.9942 1.3 

gradient MV/m 100 31.5 
Epeak MV/m 250 63 

Qo 7245 >5x 10° 
shunt impedance | MQ/m 95.4 2590000 

input power MW 62.4 0.311 
cavity length m 0.233 1 
filling time us 0.066 565 
min aperture mm 2.35 70 


3.11 Cavity Geometries 


There are several different types of cavity geometry depending on the velocity and species 
of the particles to be accelerated. Some have higher shunt impedance for low particle ve- 
locities and small gaps, while others are more suited to particles travelling at virtually the 
speed of light. At low particle velocity, RF defocusing is an issue, as will be discussed in 
Chapter 5, requiring low-frequency cavities. Low frequency cavities typically require special 
cavity shapes to keep the cavity size to practical limits. In proton and ion synchrotrons it is 
necessary to change the cavity frequency as the beam is accelerated which requires cavities 
that can quickly and repeatably alter their frequency. 

We have already discussed disk-loaded structures and side-coupled standing-wave cavi- 
ties. As these operate in the 7 and bi-periodic 7/2 modes respectively, they are best suited 
to high particle velocities. As the gap length is reduced to maintain synchronism with lower- 
velocity particles the distances between the disks is reduced, as shown by Equation 3.100. 
This reduces the shunt impedance since more of the length is now taken up with the disks 
which have a finite thickness reducing the gradient, and the increased RF losses on the disks 
increases the power losses. As a result, other cavity shapes may be more effective for use 
with low-velocity protons and ions. Typically disk-loaded cavities or side-coupled cavities 
are used for particle velocities above 0.5c; however, structures have been realised at lower 
particle velocities [44]. Since we use the symbol 8 = v/c, cavities designed for low, medium 
and high particle velocities are referred to as low-beta, medium-beta and high-beta cavities. 

For lower frequencies (<200 MHz) the cavity size can be difficult to realise practically 
for disk-loaded cavities, with diameters 0.76-1 times the wavelength depending on the cell 
length and aperture size. TEM- and TE-mode cavities — which can have smaller diameters 
with respect to wavelength — may be more practical. 

The choice of a superconducting or a normal conducting cavity changes the cavity pa- 
rameters greatly. In Table 3.2 we see a comparison between the two proposed designs for the 
next big linear lepton collider, CLIC [16] and ILC [15, 45]. As can be seen the NCRF CLIC 
cavity has a gradient reach three times higher than the SRF ILC cavity and the cavity fills 
4 orders of magnitude faster, due to the higher frequency and high group velocity. However 
the ILC cavity needs 100 times less RF power and has an aperture 30 times larger, reducing 
the wakefields considerably. 


3.11.1 Elliptical Cavities 


Disk-loaded cavities have issues with multipactor at high gradient with electrons performing 
cyclotron orbits every half RF period in the magnetic field at the equator. For normal- 
conducting cavities this is not a major issue, but for superconducting cavities the heat 
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FIGURE 3.40  Five-cell elliptical cavity for LEP; image courtesy of CERN. 


generated can severely limit cavity operation. Initial SRF cavities were limited in this way 
but later cavities avoided this by using elliptical geometries as mentioned previously [26]. 
Electrons strike the surface at different angles depending on the orbit radii, causing the 
secondaries to move in successive orbits towards the centre where the electric field is zero 
and electrons cannot be accelerated to sufficient energy to create new secondary electrons. 

Initially, the equator ellipse size was limited to ensure a sloped wall angle [15], to allow 
acid and water to drain more effectively from the cavities during cleaning, but this require- 
ment is no longer felt to be necessary [46]. By varying the wall angle we can change the 
ratio between the peak surface electric and magnetic fields. Early elliptical SRF cavities 
were limited by field emission and hence a large positive slope was used to minimise the 
peak electric field. Modern cleaning methods have reduced field emission such that the cav- 
ities are now limited by magnetic field effects such as heating, and hence smaller slopes, or 
even negative slopes can be used. The elliptical cavities for LEP are shown in F 


3.11.2 RE Electron Guns 


RF guns are electron sources with a photocathode installed inside an RF cavity. The elec- 
trons will leave the cathode at an energy of a few eV, and should be accelerated as quickly 
as possible to avoid the beam being blown up by its own self-fields (so-called space-charge 
forces). Electrons will become relativistic in a single 3 GHz cell at gradients above 30 MV/m, 
hence only the first cell needs modification, normally being around a half-cell long. At higher 
frequencies further cells may need their length modified as the smaller gap means the beam 
will not be fully relativistic at the exit of the first cell. 

As the beam is at low energy it can be very sensitive to dipole or quadrupole components 
of the field caused by coupler asymmetry; these may cause the electrical centre of the cavity 
to shift off the beam axis (dipole) or cause the field to vary with radius differently in the 
horizontal and vertical planes (quadrupole). This is avoided in two ways, either by using two 
couplers to make the field symmetric to avoid dipole components (while using an elliptical 
cross section to reduce the quadrupole component), or by using a coaxial coupler inside the 
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FIGURE 3.41 RF gun for the CLARA accelerator with coaxial coupling. 


beam pipe to maintain azimuthal symmetry as shown in Fig 3.41. 

If a coaxial coupler is used it will have a door-knob transition to a waveguide away 
from the cavity. Normally this is fed from a single side, with a short circuit on the other 
side at a fixed distance to cancel out any reflections. This will excite an additional dipole 
component, but if the dipole mode is cut-off in the coaxial line, this will not be transmitted 
to the cavity. The cut-off frequency of a coaxial line, of inner conductor radius a and outer 


conductor radius b, is given by 
2c 
c= : 3.147 
arene (3.147) 
The size of the inner conductor must be large enough to allow the laser beam to be brought 
in to the cathode, and hence in many cases the dipole mode will not be cut off. In such 


cases a dual feed door-knob can be utilised as shown in Fig 3.41 [47]. 


3.11.3  Half-Wave Resonators and Spoke Cavities 


For low-energy proton and ion beams the gap must be reduced to keep the transit-time factor 
high for single cells and to maintain synchronism for multi-cell structures, as the particle 
velocity is lower. As the gap gets smaller elliptical cavities become less mechanically stiff, 
and microphonics becomes a limiting issue. A common geometry for low-beta cavities is the 
coaxial resonator, made up of a length section of coaxial line, with conducting walls at both 
ends, shown in Fig 3.42. As the electric field component parallel to the walls must be zero 
on the conducting walls at the ends, the resonator will have resonant frequencies such that 
there is an integer number of half wavelengths between the two ends in the TEM mode, 
making it smaller than an elliptical cavity at the same frequency, as it only needs to be long 
in one axis. As the resonator is operated in the TEM mode, there are no longitudinal field 
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Beam 


FIGURE 3.42 Electric field pattern inside a half-wave resonator; (top) the beam is travelling down the 
page; (bottom) the beam is travelling out of the page. 


components and hence the structure should be oriented such that the beam will travel in 
the cavities radial direction, between the outer and inner conductor, to be accelerated by 
the radial electric field. The cylindrical shape and the inner conductor provide additional 
mechanical stiffness reducing the sensitivity to microphonics. For medium-beta (8 ~ 0.4) 
the half-wave cavity starts to become less mechanically stable. This can be remedied by 
varying the outer conductor orientation, in a structure known as a spoke resonator [48]. 
In these cavities a half-wavelength rod is placed radially across a cylinder, as shown in 
Fig 3.43. These structures work well at intermediate particle velocities, 0.15 < 8 < 0.62. 
Spokes can be sensitive to multipactor; however, altering the shape of the outer conductor 
can mitigate this [49]. If multiple cells are required to maximise the voltage that can be 
obtained in a finite length, several rods can be placed inside one cylinder along the length 
creating multi-cell cavities [50]. The rods are strongly coupled to each other as there are no 
walls between them to prevent the field from one rod reach the next rod. Spoke resonators 
have also been proposed for accelerating relativistic electrons as they are smaller radially 
than elliptical cavities [51]. 


3.11.4 Quarter-Wave Cavities 


For even lower frequencies, even half-wave resonators become too large due to the need 
to be a half-wavelength long in one axis. The resonator size can be reduced by a factor 
of two in the long axis by using a quarter-wave cavity instead [53]. Quarter-wave cavities 
are again coaxial resonators; however, while one side has a conducting wall at the end, the 
other side has a gap between the inner conductor and the wall creating a capacitive loading 
of the resonator. At the capacitive gap, the potential on the inner conductor produces a 
longitudinal electric field across the gap, allowing the electric field to be maximised at one 
end in the gap and zero at the other end at the conducting wall, making the resonator 
approximately a quarter wavelength long and so making it half the transverse size of a 
half-wave resonator. If we consider that the admittance at the end of the inner conductor, 
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FIGURE 3.43  Sectioned view of a 345 MHz triple-spoke-loaded cavity for 8 = 0.5 from [52]. 


at the capacitive gap, should be zero, we can state that the admittance of the capacitor 
should be equal and opposite the admittance of the line. The impedance of the line varies 
along the line, l, as, Z(l) = Ze tan(k,l), hence the resonant frequency for a line of length L 
can be given as 


s = Zatan kL (3.148) 


where k, = w/c and C is the capacitance of the gap at the end. 

As there are electric fields in the gap between the inner conductor and the end plate as 
well as between the inner and outer conductor, a quarter-wave resonator can be oriented to 
accelerate electrons travelling either radially or longitudinally depending on if it is better 
suited to make it compact longitudinally or transversely, respectively. Where the electron 
beam travels between the inner and outer conductor, there is typically a small beam tube 
cut radially through the inner conductor near the tip of the inner conductor, creating an 
accelerating gap on either side of the inner conductor; this is shown in Fig 3.44. There will 
still be some magnetic field at the beam tube and hence care must be taken to ensure the 
beam’s trajectory isn’t disrupted due to this. The quarter wavelength is in the transverse 
direction, hence these cavities are transversely large. Quarter-wave cavities can also have 
electron beams travel longitudinally parallel to the coaxial line rather than radially, but in 
these cavities the ratio of the cavity length (around 1/4 of the wavelength) to the accelerating 
gap is large but the radius can be very small. There is a beam pipe cut into the inner 
conductor and the beam only experiences acceleration in the small gap between the inner 
conductor and the end plate, as shown in Fig 3.45. Such geometries have been proposed as 
low-frequency RF electron guns, and for very low-frequency cavities such as the 56 MHz 
cavities in RHIC [54]. 

Another example of a quarter-wave resonator is the RF system in many cyclotrons. In a 
cyclotron the acceleration takes place in the capacitive gap between two electrodes, known 
as Dees as the original designs had ‘D’-shaped electrodes. However the Dees themselves are 
not resonant structures at the low frequencies required in cyclotron RF systems; hence in 
order to have a resonant structure, the Dees are each connected to a quarter-wave resonator 


i ag DO es a ak e ai dk ae a ia ae 


The Science and Technology of Particle Accelerators 


94 


She eee E 


E Ek 
tttrtrtr err rere 


fo AH Nit fai Gh GAR, fh Bak eo. A, 
se foe Ts AE BE OE I AN E SE SD D ee An 5 


FIGURE 3.44 Electric field pattern inside a vertically-oriented quarter-wave resonator. 
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FIGURE 3.45 Electric field pattern inside a longitudinally-oriented quarter-wave resonator. 
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FIGURE 3.46 Geometry of the RF electrodes used in early cyclotrons, showing the Dees and the 
quarter-wave lines. The entire RF system and liner is inserted between the two cyclotron poles from the 
side. 


called a stem, made of a coaxial line with a short at the end and the Dees forming the 
capacitive ends. These stems can be seen in Fig 3.46 and a more modern vertical stem 
in Fig 3.47. Often these shorts are movable to provide frequency variation. In very large 
fixed-frequency cyclotrons — such as the PSI 590 MeV cyclotron [55] — the quarter-wave 
resonators are sometimes replaced with large waveguide resonators. It is also possible to 
use a double-gap system such that the central electrode can support a TEM mode, and the 
electrodes are made to be a half-wavelength long [56]. 

In proton and ion synchrotrons the RF frequency, and hence the cavity frequency, has 
to change as the beam is accelerated and as the revolution time changes. One method of 
doing so is to load the cavity with a ferrite, which is a ferromagnetic material with lower RF 
losses [57]. A ferrite has a permeability that varies as a function of applied magnetic field. 
An electromagnet can be used to bias the ferrite, changing the permeability and hence 
the resonant frequency. These are typically longitudinally-oriented quarter-wave cavities 
with rings of ferrite placed in the base of the cavity where the magnetic field is strongest. 
Amorphous and nano-crystalline magnetic alloy materials can also be used that have much 
higher permeability and a much lower Q. The low Q gives a wide frequency range such that 
tuning may not be required. 

A comparison of quarter-wave, half-wave, spoke and elliptical cavities/resonators is 
shown in Fig 3.48 showing the particle velocity and frequency range where each is most 
effective. Generally, higher frequency cavities are preferred as they are smaller, however one 
would not use a low-beta cavity at high frequency due to transverse defocusing as discussed 
in Chapter 5. A high cavity frequency may also limit the beam pipe aperture creating 
stronger wakefields. 


3.11.5 Drift-Tube Linacs (DTLs) 


A fraction of the RF losses in a disk-loaded waveguide occurs on the disks. The distance 
between the centres of any two disks is proportional to the beam velocity for a fixed phase 
advance, and so the number of disks per metre, and hence the RF losses on the disks, 
increases as the beam energy decreases. At a certain point disk-loaded waveguides start 
to become very inefficient and hence accelerating cavities without disks are required. In 
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FIGURE 3.47 Illustration of the Dees and RF liner in a modern AVF cyclotron that includes pole hills 
and valleys. On the left is shown the entire yoke, pole and RF system. On the right can be seen the 3 Dees 
with vertical stems, surrounded by liners situated in the valleys. 
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FIGURE 3.48 The frequency and particle velocity range for each cavity type from [58]. 
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drift-tube linacs the distance between gaps is longer than the distance the beam travels in 
a half RF period and hence the beam would normally be in the gap during the decelerating 
phases. To avoid the beam being decelerated, the particles need to be shielded from the 
RF fields during phases that would decelerate the beam by having the beam pass through 
small aperture beam pipes, referred to as drift-tubes, as previously mentioned. Hence these 
devices are known as drift-tube linacs (DTLs). 

There are two standard drift-tube types: 


e Widerge linacs operate at very low frequencies in a TEM mode, with every drift-tube 
held at the opposite potential from the drift-tube at either side, which alternates in 
time such that the RF is accelerating when the beam arrives at the gap between drift 
tube; 


e Alvarez linacs have a TMoi9 mode in a long (compared to the distance the beam will 
travel in an RF period) cylindrical tank with several drift tubes separated by an integer 
number of wavelengths, shown in Fig 3.49. 


In both cases as the beam is shielded from the RF most of the time, the gradient and 
shunt impedance is low, as well as the gradient which is typically 5-10 MV/m, but at low 
beam velocity they perform well compared to other cavities. An Alvarez DTL typically has 
a shunt impedance of about 50 MQ/m at 20 MeV, which drops to around 20 MQ/m at 
200 MeV [59] whilst a disk-loaded cavity typically has a higher shunt impedance at higher 
energies. The drift tubes are supported by stalks, which in the case of Alvarez linacs, are 
a quarter wavelength long making them resonant, which makes the fields less sensitive to 
variations in dimensions. In order to focus the beam, quadrupole magnets can be placed 
inside some of the drift tubes. As the beam velocity increases, it is possible to use a hybrid 
geometry, where a coupled cavity linac can have one or two drift tubes inserted inside 
each cell, giving a higher shunt impedance at intermediate beam energies. Such cavities are 
known as coupled-cavity, drift-tube linacs (CC-DTLs). 

Typically, Widerge linacs are used for very-low-velocity particles like heavy ions, where 
we need a very low frequency to reduce RF defocusing (see Chapter 7). Alvarez DTLs 
are commonly used for proton machines at intermediate particle velocities — like Linac4 at 
CERN - for particle velocities in the range 0.05 < 6 < 0.5. The Alvarez DTL in CERN’s 
Linac4 operates at a frequency of 352 MHz and this requires a tank diameter of 500 mm. 
It is subdivided into three tanks and is 19 m long in total, with about 110 drift tubes to 
accelerate protons from 3 MeV to 50 MeV (£ varies from 0.08 to 0.31), taking the cell length 
from 68 mm at the entrance to 264 mm at the exit [60]. 


3.11.6 TE Mode Linacs 


In order to accelerate particles, we require a longitudinal electric field, hence TE modes 
(known as H modes in some countries) in constant cross-section cavities cannot be used 
to accelerate charged particles. However, by inserting crossbars, shown in Fig 3.50, inside 
a resonant tank we may perturb the fields to give them a local longitudinal electric field 
near the crossbar. As it only extends a short distance from the crossbar these structures, 
known as CH (‘crossbar H-mode’) structures, are only useful at very low velocities. A similar 
device uses interdigital stalks, shown in Fig 3.51, to achieve the same effect; this is known 
as an IH (‘interdigital H-mode’) structure. These structures are smaller than Alvarez DTLs 
for a given frequency, which is useful when using very-low-frequency systems where the 
wavelength can be several metres in size. 

TE modes typically have lower surface magnetic fields than the TMj19 mode and hence 
ohmic losses are reduced allowing very high shunt impedance, close to 100 MQ/m at low 
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FIGURE 3.50 Superconducting CH (crossbar H-mode) resonator from [61]. 
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FIGURE 3.51 IH (interdigital H-mode) resonator for ISAC radioactive ion beam facility at TRIUMF; 
[62] 


particle velocities; however, this drops sharply with increasing particle velocity. For this 
reason CH structures are only used for 6 < 0.3 [63]. 


3.11.7 Radio-Frequency Quadrupoles (RFQs) 


For very-low-energy hadrons (such as protons) the space charge in the beam at low energy 
can blow the beam apart; hence, for intense beams we need to focus, bunch, and accelerate 
the particles at the same time. As we will see in Chapter 5, RF focusing and bunching 
occur at opposite phases, such that focusing phases are debunching and defocusing phases 
are bunching. Focusing of the beam can be achieved, while simultaneously accelerating the 
beam, by using electrostatic quadrupoles; four electrodes are used, with each electrode hav- 
ing the opposite potential to the electrodes on either side of it. This creates a focusing 
electric field in one plane (say, x), and defocusing in the other plane. The focusing plane is 
alternated by the longitudinal oscillation of the wave on the line to achieve a net focusing 
effect in both planes. If we have a corrugation on the surface of each electrode but with a 
longitudinal separation in the peaks, we can create an additional longitudinal field compo- 
nent, which can be used to bunch and accelerate. Such a device is called a radio-frequency 
quadrupole (RFQ). There are two types of RFQ: vane and rod. A four-vane RFQ operates 
in a TE mode, with azimuthal index m = 2 giving a quadrupole mode. These are simpler 
to manufacture but are only of feasible size at higher frequencies above 200 MHz. The 
four-vane RFQ for Linac4 at CERN is shown in Fig 3.52. A 4-rod RFQ has a corrugated 
longitudinal rod as each electrode, allowing the structure to operate in a TEM mode mak- 
ing the transverse size independent of the operating frequency; hence these structures are 
mostly used where lower frequencies are required [30]. 
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FIGURE 3.52 Four-vane RFQ for Linac4 at CERN; image courtesy of CERN. 


3.12 RF Sources 


RF structures require input powers between tens of kW to tens of MW to reach gradients of 
tens of MV/m. Typically, for large accelerators, the RF source is always an amplifier where 
the output signal is a higher-power copy of the input signal. Unlike low-power RF oscillators, 
high-power oscillators are typically not stable enough to use when two or more sources are 
required to be combined or synchronised, although phase-locked oscillators are possible for 
long pulses but are rarely used. This is often due to electron loading or thermal effects 
at high-power. For small industrial accelerators — where there is no need to synchronise — 
oscillators can be used. Typically an RF system will comprise a high power RF (HPRF) 
amplifier and a low-level RF system (LLRF), which will take feedback from the cavity and 
send the correct drive signal to the amplifier to keep the cavity voltage at the setpoint 
voltage and phase. 
RF amplifiers are typically characterised by a few key parameters: 


e Saturated output power: This is the maximum RF power an RF source can produce 
when overdriven. No accelerators operate at this power level as the control system will 
need to increase and decrease power to keep the cavity voltage constant in the presence 
of disturbances, so typically the operating power is 1 to 3 dB less than the saturated 
output power. 


e Gain: This is the ratio of RF output power to RF input power, typically expressed 
in dB. In many devices this is constant at low to intermediate power but decreases 
with increasing output power. The 1 dB compression point is the output power where 
the gain is reduced by 1 dB from the gain at intermediate powers. Gain is given by 
G = 10log Pout/Pin. Typically an amplifier will not have enough gain to go from the 
LLRF power to the operating power in a single stage, hence a series of lower power 
amplifiers are often required. 


e RF efficiency: This is the ratio of the saturated RF output power to the DC input 
power. High-power RF sources typically operate at efficiencies at or below 50%, sig- 
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nificantly increasing the electricity costs of the facility. Any remaining energy in the 
electron beam must be dissipated in a load known as a collector. Proposed future high- 
energy lepton colliders have a total RF power usage of 100-180 MW so the difference 
between a 40% efficient and an 80% efficient amplifier has a major impact on running 
costs. 


e Harmonic content: This is the ratio of the output power at the design frequency to 
the output power at the harmonics of the drive frequency. This is measured in dBc 
(decibels relative to the carrier). 


High-power RF sources for accelerators come in many varieties, and different types depend- 
ing on the frequency and power required [64]. 


3.12.1 Gridded Tubes 


In these devices a biased metal wire grid is placed close to the cathode in a vacuum diode. 
As the bias grid is closer to the cathode than the anode, it can create the same electric field 
with a lower voltage and can hence control the space-charge limited emission current in 
time by varying the bias voltage, with emitted electrons being accelerated to the full anode- 
cathode voltage. The wires in the grid are thin to avoid intercepting the electrons. Typical 
devices are triodes and tetrodes (which include a 4th screening grid), shown in Fig 3.53. 
These devices are typically low gain, around 13 dB, and have issues at higher frequencies 
as the electron must pass the cathode-grid gap in a half RF period, and hence tend to be 
used below 500 MHz. 

A more efficient coupling from the beam to the RF can be obtained by replacing the 
anode with a resonant cavity with a high shunt impedance. This type of device is known as 
an inductive output tube (IOT) and is shown in 3.54. These devices have more gain than 
a tetrode (20-30 dB) and operate up to ~3 GHz but tend to have relatively low output 
powers of under 100 kW. 

In all gridded tubes the grid can be DC biased to change the current waveform, known 
as different amplifier classes. If the DC and RF voltages are equal, known as class A, the 
device conducts at all phases providing perfect sinusoidal current profiles, but at the cost 
of efficiency due to the large DC component. If there is no DC bias, known as class B, the 
current waveform is a half-wave rectified sinusoid, and hence has higher harmonics of the 
RF frequency. In class C amplifiers a negative DC bias is used so the device only conducts 
for a small fraction of the RF period, giving even more harmonics but highest efficiencies. 


3.12.2 Klystrons 


To obtain higher powers and/or frequencies we cannot utilise a grid. The grid can be avoided 
by utilising velocity bunching where a DC beam traverses an input RF cavity which acceler- 
ates some electrons and decelerates others. As the electron beam travels down the beampipe 
the faster particles catch up with the slower ones forming discrete bunches. This effect can 
be enhanced by using several additional intermediate bunching cavities, which are excited 
by the bunched beam but phased to provide further bunching, as shown in Fig 3.55. When 
fully bunched the beam passes through an output cavity, tuned to maximise the power 
output, and is then dumped [64]. Klystrons can provide powers of tens of MW up to fre- 
quencies of tens of GHz, and have very high gain (~ 50 dB). However, velocity bunching isn’t 
perfect, and due to the requirement to operate below maximum power to allow overhead 
for RF control, klystrons do not operate at very high efficiencies (~30-40% at operation). 
More recently developments have investigated high efficiency klystrons providing maximum 
efficiencies of 80% [65]. The lower the beam current for a given voltage, the higher the 
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FIGURE 3.53 Layout of a tetrode gridded tube. 
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FIGURE 3.54 Diagram of an IOT. 
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FIGURE 3.55 Geometry of a klystron. 


efficiency so splitting the beam current into several lower current beamlets that traverse the 
same cavities, known as a Multi-Beam Klystron (MBK), can provide higher efficiencies. As 
klystrons can be quite long a solenoid magnet is required to confine the beam. High power 
klystrons can require very high cathode voltages up to 500 kV, requiring the high voltage 
end to be operated inside an oil tank to prevent arcing. 


3.12.3 Solid-State Power Amplifiers 


For other lower-power applications, semiconductor transistor amplifiers are commonly used, 
but these are limited to around 100 W for laterally diffused metal oxide semiconductors 
(LDMOS). For high average power applications, thousands of these LDMOS transistors can 
be combined to achieve hundreds of kW of RF power. At higher frequencies, above 1 GHz, 
GaN transistors are preferred for their higher efficiency. The big advantage of solid-state 
amplifiers is that the transistors fail gradually and with regular maintenance the amplifier 
can be made to run without downtime. This is particularly important for 3rd generation 
light sources. They also operate at much lower voltages and can be air cooled. As the size 
and cost is dominated by the peak power, such devices are large and not cost effective for 
short pulse, high peak power applications. 


3.12.4 Magnetrons 


Most particle accelerators require several RF cavities to be individually powered, but each 
cavity should be synchronised with the beams arrival time, meaning that only amplifiers 
where the phase can be tighly controlled can be used. For applications requiring only a 
single RF structure with a DC electron beam, oscillators can be used instead as there 
is no requirement for synchronising. The magnetron is the most common high-power RF 
oscillator, due to its compact size, low cost, and high efficiency, and is commonly utilised 
in industrial and medical linacs. In a magnetron, electrons are launched from the cathode 
at the inner conductor of a coaxial line. The electrons are made to follow circular orbits 
due to an external axial magnetic field. The magnetic field is chosen so the electrons fall 
just short of the anode/outer conductor and return to the cathode. Interaction with an RF 
field means that electrons that are decelerated in the first half cycle (hence giving energy to 
the RF field) will lose energy and will have a larger cyclotron radius and will hence hit the 
anode, rather than returning to the cathode and being accelerated on the 2nd half cycle. To 
enhance the process the anode is formed into a series of resonant cavities, with the use of 
vanes. The electrons that gain energy from the RF form a cloud around the cathode known 
as the sub-synchronous zone while the electrons losing energy to the RF form spokes as 
can be seen in Fig 3.56. Magnetrons for accelerators operate up to 9.3 GHz and provide 
a few MW of RF power but their oscillation frequency can vary by 0.1 % due to thermal 
expansion, reflected power, magnetic field changes and power supply ripple. For this reason 
they are typically only used to drive single cavity industrial and medical linacs where the 
drift in drive frequency is not an issue. 
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FIGURE 3.56 A vaned magnetron showing the electron beam spokes. 


It is possible to seed a magnetron such that the oscillations will phase-lock to an ex- 
ternally injected RF signal, however due to the frequency variations mentioned above, it 
takes a significant amount of RF power to lock a magnetron. More recently there has been 
research into providing feedback to reduce the frequency variation by altering the power 
supply or the magnetic field [66]. The feedback means that the magnetron will phase lock 
at reduced input powers, opening the door to cheaper RF sources for high average power 
applications. 


3.12.5 Dielectric Laser Accelerators 


To achieve higher gradient we can use higher-frequency RF, moving to millimetre waves, 
THz frequencies or even higher. The breakdown rate is known to scale with frequency and 
pulse length, both of which allow higher field strengths at higher frequencies. However, as 
the frequency increases, the wavelength decreases, making the structures much smaller and 
harder to manufacture. At smaller wavelengths, a particle bunch will cover a wider range 
of RF phases making capture of electron beams more difficult. There are several methods 
of interacting with high-frequency accelerators: 


e diffraction gratings [67], 

e inverse free-electron laser [68], 

e scaled-down RF cavities [69], 

e photonic bandgap structures [70], 
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e waveguide loaded with dielectric or corrugations [31] [71]. 


Significant gradients, above 300 MV/m, have already been achieved at optical and in- 
frared wavelengths [67], but as the beam is often longer than the wavelength, the beam 
obtains a large energy spread. To avoid this we need to go to longer wavelengths, with 
the ideal wavelength being around 0.6 mm (i.e. a frequency of around 0.5 THz) [31]. At 
this frequency, high-power radiation sources are typically of wide bandwidth and hence 
the structures also need to be wideband to utilise all the THz pulse energy. The band- 
width is limited by the fact that not all frequencies will travel at the beam velocity, and 
hence the pulse will slip out of synchronism with the beam at higher and lower frequencies. 
Making cavity-like structures is difficult at higher frequencies, so other ways are required 
to maintain synchronism; these include dielectric-loaded waveguide, corrugated waveguide, 
and all-dielectric accelerators made from photonic bandgap structures. The sources to drive 
the fields include lasers and Cerenkov generation in non-linear crystals, but they can also 
be excited by the beam in a dielectric wakefield accelerator [72]. Here, either the head of a 
bunch is decelerated and the tail accelerated or one drive bunch drives a wake to accelerate 
a separate witness bunch. 


3.12.6 Plasma Accelerators 


Another method to achieve gradients beyond the breakdown limit of copper at microwave 
frequencies is to use plasma to accelerate particles. A strong electromagnetic field coming 
from either an intense laser [73] or a charged particle beam [74] creates a channel in the 
plasma where the electrons are repelled, or at lower intensities, a displacement of electrons 
occurs. As the electrons return to the channel, attracted by the positive charge generated 
by the newly created ions, they develop a large travelling electric field which can be used to 
accelerate a short bunch of electrons. Such an accelerator can generate very high gradients 
in the GV/m range; however, issues remain in trying to achieve beams of sufficient quality 
to be utilised for most applications. Technical issues also need to be solved around laser 
efficiency, the ability to use more than one acceleration stage, stability, and increasing 
average beam power. The concept was originally devised by Tajima and Dawson in 1979 
and was experimentally verified by Joshi in 1984. The current record generates 7.8 GeV 
in 20 cm providing a gradient of 39 GV/m using a petawatt laser [75]. The energy gain is 
inversely proportional to the plasma density, no, as the gradient scales with \/no and the 


laser depletion length scales as ng 3/2 More recently, laser heating techniques have sought 
to circumvent this, in which case the interaction length is limited by dephasing between the 
wake and the accelerated electron beam. To increase the acceleration length in a plasma, 
the AWAKE collaboration has demonstrated the use of energetic proton beams from the 
super proton synchrotron (SPS) at CERN to drive a wake in a 10 m plasma, which then 
accelerated an injected electron beam from 19 MeV to 2 GeV. Such a concept could be 
extrapolated to use the 13 TeV proton beam from the LHC to create a TeV-scale electron 
collider [74]. 
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Exercises 


1. A cathode operating in temperature-limited thermonic emission has a diameter of 
3 cm and a work function of 2 eV. Calculate the temperature of the cathode to have an 
emission current of 200 mA, and the minimum voltage required to ensure temperature- 
limited emission. 


2. Derive the shunt impedance per unit length for a pillbox cavity at 12 GHz, where the 
cavity is designed to accelerate electrons travelling at G=1. 


3. Ifa cavity has a shunt impedance of 100 MQ and has an accelerating voltage of 100 MV, 
calculate the power required to accelerate a beam current of 10 mA. 


4. A 1.3 GHz cavity has an ohmic Q factor of 101°, an external Q factor of 10° and is 
driven by a 10 kW RF source. What is the stored energy in the cavity, and the reflected 
power for a steady-state situation? 


5. A 2/2 side-coupled structure at 3 GHz has 15 accelerating cells and 14 coupling cells. If 
there is a 1 MHz difference between the accelerating and side-coupled cell frequencies 
calculate the coupling required to have the accelerating fields in each cell within 1% of 
each other. 


6. A 60 cell, 12 GHz, 27/3 constant-impedance travelling-wave structure is fed with a 
10 MW amplifier and each cell has a Qo of 10,000. Calculate the group velocity required 
to maximise the accelerating voltage, and the external Q of the couplers. 


7. Calculate the maximum surface magnetic field due to pulsed RF heating on a 12 GHz 
copper cavity, if the RF pulse duration is 1 ps. 

8. A500 MHz niobium SRF cavity is cooled to a temperature of 4.2 K. If it has a geometry 
factor, G, of 100 Q and a residual resistance of 5 nQ, what is the ohmic Q of the cavity? 


9. A probe-type HOM coupler has a coupling capacitance of 5 pF. If it is mounted on a 
1.3 GHz cavity, with the highest-impedance HOM at 2.5 GHz, design a circuit to filter 
the operating mode’s frequency and compensate at the HOM frequency; calculate the 
values of any capacitors or inductors used. 


References 


1. E Skaria and BM Varghese. DC-DC booster with cascaded connected multilevel voltage 
multiplier applied to transformer less converter for high power applications. J. Electr 
& Electron. Eng, 9(5):73-78, 2014. 

2. JD Cockcroft and ETS Walton. Experiments with high velocity positive ions. (i) further 
developments in the method of obtaining high velocity positive ions. Proceedings of 
the Royal Society of London. Series A, Containing Papers of a Mathematical and 
Physical Character, 136(830):619-630, 1932. 

3. C. Young, M. Chen, T. Chang, C. Ko, and K. Jen. Cascade cockcroft”’ walton voltage multi- 
plier applied to transformerless high step-up dc“dc converter. IEEE Transactions on 
Industrial Electronics, 60(2):523-537, Feb 2013. 

4. MA Kemp. Solid-state Marx Modulators for Emerging Applications. 2012. 

5. YY Lau, Youfan Liu, and RK Parker. Electron emission: From the fowler—nordheim relation 
to the child-langmuir law. Physics of Plasmas, 1(6):2082—2085, 1994. 

6. T Siggins, C Sinclair, C Bohn, D Bullard, David Douglas, A Grippo, J Gubeli, GA Krafft, and 
B Yunn. Performance of a DC GaAs photocathode gun for the Jefferson lab FEL. Nu- 
clear Instruments and Methods in Physics Research Section A: Accelerators, Spec- 
trometers, Detectors and Associated Equipment, 475(1-3):549-553, 2001. 


Acceleration 107 


7. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


Richard G Forbes and Jonathan HB Deane. Reformulation of the standard theory of fowler- 
nordheim tunnelling and cold field electron emission. Proceedings of the Royal Society 
A: Mathematical, Physical and Engineering Sciences, 463(2087):2907-2927, 2007. 

K.L Jensen, Y.Y Lau, D.W Feldman, and P.G O’Shea. Electron emission contributions to 
dark current and its relation to microscopic field enhancement and heating in accelerator 
structures. Physical Review Special Topics: Accelerators and Beams, 11(8):081001, 
2008. 

D Faircloth. Ion sources for high-power hadron accelerators. arXiv preprint 


arXiv:1802.8745, 2013. 


. WD Kilpatrick. Criterion for vacuum sparking designed to include both RF and DC. Review 


of Scientific Instruments, 28(10):824-826, 1957. 


. PF Dahl. Rolf widere: Progenitor of particle accelerators. SS'CLI-SR-lU86, 1992. 
. LW Alvarez, H Bradner, JV Franck, H Gordon, JD Gow, LC Marshall, F Oppenheimer, WKH 


Panofsky, C Richman, and JR Woodyard. Berkeley proton linear accelerator. Review 
of Scientific Instruments, 26(2):111-133, 1955. 


. DW Fry, RBR-S Harvie, LB Mullett, and W Walkinshaw. A travelling-wave linear accelerator 


for 4-MeV electrons. Nature, 162(4126):859, 1948. 


. EL Chu and W.W Hansen. Disk-loaded wave guides. Journal of Applied Physics, 20(3):280- 


285, 1949. 


. Bernard Aune, R Bandelmann, D Bloess, B Bonin, A Bosotti, M Champion, C Crawford, 


G Deppe, B Dwersteg, DA Edwards, et al. Superconducting TESLA cavities. Physical 
Review Special Topics: Accelerators and Beams, 3(9):092001, 2000. 

M. Aicheler, P. Burrows, M. Draper, T. Garvey, P. Lebrun, K. Peach, N. Phinney, H. Schmick- 
ler, D. Schulte, and N. Toge, editors. A multi-TeV linear collider based on CLIC 
technology: CLIC Conceptual Design Report, volume CERN-2012-007. 2012. 

D.M Pozar. Microwave Engineering; 3rd ed. Wiley, Hoboken, NJ, 2005. 

H. Padamsee, J Knobloch, T Hays, et al. RF Superconductivity for Accelerators, volume 
2011. Wiley Online Library, 2008. 

TJ Boyd Jr. Kilpatrick’s criterion. Los Alamos Group AT-1 Report, 82:28, 1982. 

Alberto Degiovanni, Walter Wuensch, and Jorge Giner Navarro. Comparison of the condi- 
tioning of high gradient accelerating structures. Phys. Rev. Accel. Beams, 19:032001, 
Mar 2016. 

A. Grudiev, S. Calatroni, and W. Wuensch. New local field quantity describing the high 
gradient limit of accelerating structures. Phys. Rev. ST Accel. Beams, 12:102001, Oct 
2009. 

F. Djurabekova, S Parviainen, A Pohjonen, and K Nordlund. Atomistic modeling of metal 
surfaces under electric fields: Direct coupling of electric fields to a molecular dynamics 
algorithm. Physical Review E, 83(2):026704, 2011. 

K. Nordlund and F. Djurabekova. Defect model for the dependence of breakdown rate on 
external electric fields. Physical Review Special Topics: Accelerators and Beams, 
15(7):071002, 2012. 

M. A. Furman and M. T. F. Pivi. Probabilistic model for the simulation of secondary electron 
emission. Physical Review Special Topics: Accelerators and Beams, 5:124404, Dec 
2002. 

G. Burt and A. C. Dexter. Prediction of multipactor in the iris region of rf deflecting mode 
cavities. Physical Review Special Topics: Accelerators and Beams, 14:122002, Dec 
2011. 

Pasi Yla-Oijala. Electron multipacting in TESLA cavities and input couplers. Particle 
Accelerators, 63:105-137, 1999. 

D.P Pritzkau and R.H Siemann. Experimental study of RF pulsed heating on oxygen free elec- 
tronic copper. Physical Review Special Topics: Accelerators and Beams, 5(11):112002, 
2002. 


108 The Science and Technology of Particle Accelerators 


28. 


29 


30 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


M. Jenkins, G Burt, AV Praveen Kumar, Y. Saveliev, P. Corlett, T. Hartnett, R Smith, 
A Wheelhouse, P McIntosh, and K Middleman. Prototype 1 MeV X-band linac for 
aviation cargo inspection. Physical Review Accelerators and Beams, 22(2):020101, 
2019. 

S. Pitman. Optimisation studies for a high gradient proton Linac for application in 
proton imaging: ProBE: Proton Boosting Linac for imaging and therapy. PhD thesis, 
Lancaster University, 2019. 

T. P. Wangler. RF Linear Accelerators, Second Edition. Wiley, 2008. 

MT Hibberd, AL Healy, DS Lake, V Georgiadis, EJH Smith, OJ Finlay, TH Pacey, JK Jones, 
Y Saveliev, DA Walsh, et al. Terahertz-driven acceleration of a relativistic 35 MeV 
electron beam. In 2019 44th International Conference on Infrared, Millimeter, and 
Terahertz Waves (IRMMW-THz), pages 1-2. IEEE, 2019. 

C. Nantista, S. Tantawi, and V. Dolgashev. Low-field accelerator structure couplers and design 
techniques. Physical Review Special Topics: Accelerators and Beams, 7(7):072001, 
2004. 

N. M. Kroll, C. K. Ng, and D. C. Vier. Applications of time domain simulation to coupler 
design for periodic structures. In Proceedings of 20th International Linac Conference, 
Linac 2000, Monterey, USA, pages 614-617. 

K Pepitone, S Doebert, G Burt, E Chevallay, N Chritin, C Delory, V Fedosseev, Ch Hessler, 
G McMonagle, Oznur Mete, et al. The electron accelerator for the AWAKE experi- 
ment at CERN. Nuclear Instruments and Methods in Physics Research Section A: 
Accelerators, Spectrometers, Detectors and Associated Equipment, 829:73-75, 2016. 

S Benedetti, A Grudiev, and A Latina. High gradient linac for proton therapy. Physical 
Review Accelerators and Beams, 20(4):040101, 2017. 

WJ Gallagher. Design of travelling wave electron linear accelerators. In IEEE Transactions 
on Nuclear Science, number 3, page 282, 1967. 

A Grassellino, A Romanenko, D Sergatskov, O Melnychuk, Y Trenikhina, A Crawford, 
A Rowe, M Wong, T Khabiboulline, and F Barkov. Nitrogen and argon doping of 
niobium for superconducting radio frequency cavities: a pathway to highly efficient ac- 
celerating structures. Superconductor Science and Technology, 26(10):102001, aug 
2013. 

T. Junginger. EuCARD-BOO-2012-004, 2012. 

J-M Vogt, O Kugeler, and J Knobloch. High-Q operation of superconducting RF cavities: 
Potential impact of thermocurrents on the RF surface resistance. Physical Review 
Special Topics-Accelerators and Beams, 18(4):042001, 2015. 

D Reschke et al. Challenges in SRF module production for the European XFEL. In Pro- 
ceedings of the 15th International Workshop on RF Superconductivity, Chicago, IUl., 
USA, 2011. 

D. Broemmelsiek, B. Chase, D. Edstrom, E. Harms, J. Leibfritz, S. Nagaitsev, Y. Pischalnikov, 
A. Romanov, J. Ruan, W. Schappert, et al. Record high-gradient SRF beam acceleration 
at Fermilab. New Journal of Physics, 20(11):113018, 2018. 

A. Macpherson, K. Hernndez-Chahn, C. Jarrige, P. Maesen, F. Pillon, K. Schirm, R. Torres- 
Sanchez, and N. Valverde Alonso. CERN’s bulk niobium high gradient SRF programme: 
developments and recent cold test results. page MOPB074. 5 p, 2015. 

J. Mitchell. DQW Crab Cavity HOMs and Dampers for the HL-LHC. Lancaster University, 
2019. 

U. Amaldi, P Berra, K Crandall, D Toet, M Weiss, R Zennaro, E Rosso, B Szeless, M Vretenar, 
C Cicardi, et al. LIBO ~ a linac-booster for protontherapy: Construction and tests 
of a prototype. Nuclear Instruments and Methods in Physics Research Section A: 
Accelerators, Spectrometers, Detectors and Associated Equipment, 521(2-3):512-529, 
2004. 


Acceleration 109 


45. 


46 


47. 


48. 


49. 


50. 


5l. 
52. 
53. 
54. 


55. 


56. 


57. 


58. 


59. 
60. 


61. 


62. 


63 


et al Adolphsen C. The International Linear Collider Technical Design Report: Volume 
3. II: Accelerator Baseline Design. 2013. 

J. Sekutowicz, K Ko, L Ge, L Lee, Zenghai Li, C Ng, G Schussman, Liling Xiao, I Gonin, 
T Khabibouline, et al. Design of a low loss SRF cavity for the ILC. In Proceedings of 
the 2005 Particle Accelerator Conference, pages 3342-3344. IEEE, 2005. 

B. Militsyn, L. Cowie, P. Goudket, J. McKenzie, and A. Wheelhouse. Design of the high 
repetition rate photocathode gun for the clara project. In Proceedings of Linac2014, 
Geneva, Switzerland, 2014. 

G Olry, JL Biarrotte, S Blivet, S Bousson, F Chatelet, T Junquera, A Le Goff, J Lesrel, 
C Milot, AC Mueller, et al. Development of SRF spoke cavities for low and interme- 
diate energy ion linacs. In Proceedings of the 9th International Workshop on RF 
Superconductivity, volume 3, page 76, 2003. 

Z Yao, RE Laxdal, B Matheson, BS Waraich, and V Zvyagintsev. Design and fabrication of 
balloon single spoke resonator. In Proceedings of the 18th International Workshop on 
RF Superconductivity, 2017. 

G Apollinari, I Gonin, T Khabiboulline, G Lanfranco, F McConologue, G Romanov, and 
R Wagner. Design of 325 MHz single and triple spoke resonators at FNAL. [EEE 
Transactions on Applied Superconductivity, 17(2):1322-1325, 2007. 

C.S Hopper and J.R Delayen. Superconducting spoke cavities for high-velocity applications. 
Physical Review Special Topics: Accelerators and Beams, 16(10):102001, 2013. 

Michael Kelly. Superconducting spoke cavities. 2006. 

J. Delayen. Low and intermediate beta cavity design-a tutorial. Technical report, 2003. 

D Naik and I Ben-Zvi. Suppressing multipacting in a 56 mhz quarter wave resonator. Physical 
Review Special Topics: Accelerators and Beams, 13(5):052001, 2010. 

W Wagner, M Seidel, E Morenzoni, F Groeschel, M Wohlmuther, and M Daum. Psi status 
2008: Developments at the 590 mev proton accelerator facility. Nuclear Instruments 
and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors 
and Associated Equipment, 600(1):5—-7, 2009. 

CERN Accelerator School: Cyclotrons, Linacs and their Applications. Proceedings, ed- 
itor=Turner, S, year=1996, institution=European Organization for Nuclear Re- 
search. 

ISK Gardner. CERN Accelerator School on RF engineering for particle accelerators. 350, 
1992. 

M Kelly. Superconducting radio-frequency cavities for low-beta particle accelerators. Reviews 
of Accelerator Science and Technology, 5:185-203, 2012. 

F. Gerigk. Cavity types. arXiv preprint arXiv:1111.4897, 2011. 

S Ramberger, N Alharbi, P Bourquin, Y Cuvet, F Gerigk, AM Lombardi, E Sargsyan, 
M Vretenar, and A Pisent. Drift tube linac design and prototyping for the cern linac4. 
proc. Linac08, 2008. 

Chuan Zhang, Michael Busch, Florian Dziuba, Horst Klein, Holger Podlech, and Ulrich 
Ratzinger. Recent studies on a 3-17mev dtl for eurotrans with respect to rf structures 
and beam dynamics. IPAC 2010 - 1st International Particle Accelerator Conference, 
01 2010. 

A.K. Mitra, Pierre Bricault, Ken Fong, R.E. Laxdal, Raymond Poirier, and A. Vasyuchenko. 
Rf measurement summary of isac dlt tanks and dtl bunchers. pages 951 — 953 vol.2, 02 
2001. 

G. Clemente, U. Ratzinger, H. Podlech, L. Groening, R. Brodhage, and W. Barth. Devel- 
opment of room temperature crossbar-h-mode cavities for proton and ion acceleration 
in the low to medium beta range. Physical Review Special Topics: Accelerators and 
Beams, 14(11):110101, 2011. 


1 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


74. 


75 


10 The Science and Technology of Particle Accelerators 


R.G Carter. Microwave and RF Vacuum Electronic Power Sources. Cambridge University 
Press, 2018. 

D. Constable, A. Baikov, G. Burt, I. Guzilov, V. Hill, A. Jensen, R. Kowalczyk, C. Lingwood, 
R. Marchesin, and C. et al Marrelli. High efficiency klystron development for particle 
accelerators. In 58th ICFA Advanced Beam Dynamics Workshop on High Luminosity 
Circular e* e~ Colliders (eeFACT’16), Daresbury, UK, October 24-27, 2016, pages 
185-187, 2017. 

A. C. Dexter, G. Burt, R. G. Carter, I. Tahir, H. Wang, K. Davis, and R. Rimmer. First 
demonstration and performance of an injection locked continuous wave magnetron to 
phase control a superconducting cavity. Physical Review Special Topics: Accelerators 
and Beams, 14:032001, Mar 2011. 

EA Peralta, K Soong, RJ England, ER Colby, Z Wu, B Montazeri, C McGuinness, J McNeur, 
KJ Leedle, D Walz, et al. Demonstration of electron acceleration in a laser-driven 
dielectric microstructure. Nature, 503(7474):91, 2013. 

E. Curry, S Fabbri, J Maxson, P Musumeci, and A Gover. Meter-scale terahertz-driven 
acceleration of a relativistic beam. Physical Review Letters, 120(9):094801, 2018. 

M. Fakhari, A. Fallahi, and F.X Kartner. THz cavities and injectors for compact electron 
acceleration using laser-driven THz sources. Physical Review Accelerators and Beams, 
20(4):041302, 2017. 

J.R. England, R.J Noble, K. Bane, David H Dowell, C. Ng, J.E Spencer, S. Tantawi, Z. Wu, 
R.L Byer, and E. et al Peralta. Dielectric laser accelerators. Reviews of Modern Physics, 
86(4):1337, 2014. 

A. L. Lake D. S. Georgiadis V Smith E. J. H. Finlay O. J. Pacey T. H. Jones J. K. Saveliev 
Y. Walsh D. A. Snedden E. W. Appleby R. B. Burt G. Graham D. M. Jamison S. P. 
AU Hibberd, M. T. Healy. Acceleration of relativistic beams using laser-generated 
terahertz pulses. Nature Photonics, Aug 2020. 

BD O’Shea, G Andonian, SK Barber, KL Fitzmorris, S Hakimi, J Harrison, PD Hoang, 
MJ Hogan, B Naranjo, OB Williams, et al. Observation of acceleration and deceler- 
ation in gigaelectron-volt-per-metre gradient dielectric wakefield accelerators. Nature 
Communications, 7:12763, 2016. 

S.M Hooker. Developments in laser-driven plasma accelerators. Nature Photonics, 7(10):775, 
2013. 

E. Adli, A Ahuja, O Apsimon, R Apsimon, A-M Bachmann, D Barrientos, F Batsch, J Bauche, 
VK Berglyd Olsen, M Bernardini, et al. Acceleration of electrons in the plasma wakefield 
of a proton bunch. Nature, 561(7723):363-367, 2018. 

. A. J. Gonsalves, K. Nakamura, J. Daniels, C. Benedetti, C. Pieronek, T. C. H. de Raadt, 

S. Steinke, J. H. Bin, S. S. Bulanov, J. van Tilborg, C. G. R. Geddes, C. B. Schroeder, 

Cs. Tth, E. Esarey, K. Swanson, L. Fan-Chiang, G. Bagdasarov, N. Bobrova, V. Gasilov, 

G. Korn, P. Sasorov, and W. P. Leemans. Petawatt laser guiding and electron beam 

acceleration to 8 GeV in a laser-heated capillary discharge waveguide. Phys. Rev. Lett., 

122:084801, Feb 2019. 


Magnets for Beam Control and 
Manipulation 


4.1 The Family of Standard Magnetic Field Profiles. 112 
Case n = 1: Dipole * Case n = 2: Quadrupole * Case 
n = 3: Sextupole * Case n = 1,2: Gradient Dipole 

4.2 Generating an Arbitrary Magnetic Field Shape .. 117 


ta Mapiet Multiples visa nis deseecanquapaneagaadelnaspes 118 
AA. Eleeromaanets i: pccitorcieadetdseardde doe eobeagariid 120 
Practicalities of DC Magnets * Practicalities of AC 
Magnets 
AS Permanent Magnets us iccitesseeeiea viernes oieees 138 
4.6 Superconducting Magnets................... cess eee 149 


Superconducting Materials * Coil-Dominated Magnets 

e SC Undulators 
OTSA 45a se beGdb cu E TE EEEE AE concep Saabs 158 
RET o EE E Paes irks snl id Neb dam nee ees 159 


An enormous strength associated with particle accelerators is the ability we have to steer, 
focus, and otherwise manipulate the charged particle beams. This enables us to create 
accelerators with a circular geometry so the particles continuously and stably pass around 
the machine time and again or to generate very tightly focused beams down to the nanometre 
level, for example. Our ability to steer and focus particles has some similarities to using 
mirrors and lenses in conventional optics. One limitation of optics which is often overlooked 
however, is that they rely on the material properties of the item itself, the consequence of 
this being that a lens, for example, will only properly function over a restricted part of the 
electromagnetic spectrum. So, you can’t focus X-rays with a lens that focuses visible light. 
Since we manipulate charged particles with magnetic fields rather than relying upon specific 
materials, we do not have this limitation — any charged particle of any energy is effected in an 
entirely predictable and repeatable way. There are no particle energies which are ‘off-limits’ 
because Nature hasn’t provided a material or coating with the right properties! This chapter 
will explain how the standard magnetic field distributions of dipole, quadrupole, and so on 
can be generated with high quality in the real world using coils and steel poles. It will also 
consider many of the practicalities involved in designing and manufacturing highly reliable 
magnets, either static or time-varying. Finally, the application of the alternative magnet 
technologies of permanent magnets and superconducting magnets will also be covered. 
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4.1 The Family of Standard Magnetic Field Profiles 


The majority of magnetic field distributions that are used in particle accelerators are quite 
simple. The first one is the uniform, constant field which provides a bending force to the 
beam, making it take a circular path. The second one is where the field on the horizontal 
axis increases linearly with horizontal position x and passes through the origin. This applies 
a force to the beam that depends on its distance from the axis. If the beam is on-axis 
then it sees no field and passes straight through, but if it is off-axis, then it feels a force 
which bends it towards the axis proportional to x, much like an optical lens. So, this linear 
field variation with x applies focusing to the beam. A third popular field shape is one 
where the field increases with x? on the horizontal axis, which is used to correct focusing 
aberrations due to the beam of particles not all having exactly the same momentum. It 
turns out, as we shall see, that to make a pure constant field requires a two-pole magnet, 
called a dipole. One which varies with x requires four poles, and so is called a quadrupole, 
and the one which varies with x? requires six poles, and so is called a sextupole. Clearly 
there is a very simple pattern emerging here for these pure, ideal, field shapes in terms 
of the power of the field variation with x and the number of poles required to generate 
such a field. Hopefully, it is now clear why the term ‘multipoles’ is used in the accelerator 
community when discussing magnetic fields and their impact on the beams. Each multipole 
(dipole, quadrupole, sextupole, etc.) actually represents an independent term on the infinite 
polynomial series B,,x” as we shall discuss in more detail later. 

A nice feature of magnets is that these different, pure, field shapes can be added to- 
gether to make a more complex field pattern, if the beam requires it, with the ideal pole 
arrangement being readily determined. An example of this is when a combined focusing and 
bending field is required. In this case the ideal field varies linearly with x but is non-zero 
at the origin so even the beam which passes through the centre of the field feels an over- 
all bending force. This field shape, called a gradient dipole or combined function dipole, 
along with the others mentioned above, are sketched out schematically in Fig 4.1. How the 
pole shape and number of poles is determined by the field shape required will now be ex- 
plored, closely following the approach described by Tanabe [1], which provides more detail 
if required. 

We start from the two Maxwell equations which are relevant to static (i.e. do not vary 
with time) magnetic fields and also make the further assumption that there are no current 
sources. Since the charged particle beams pass through the gap between the magnet poles, 
well away from current-carrying conductors, this is a good assumption in general: 


V-B =0, (4.1) 
VxB =0. (4.2) 


Next we introduce the vector potential, A, and scalar potential, V. These two potentials are 
commonly used in vector calculus to develop an understanding of the field being analysed. 
In our case it turns out that the vector potential maps out the lines of flux and the scalar 
potential maps out the family of ideal pole shapes required for a particular magnetic field. 
Either of these potentials can be used to determine the magnetic field since, due to standard 
results from vector calculus, we can also write 


B = VxA, (4.3) 
B = -VV. (4.4) 


Both A and V satisfy the Laplace equation 


VA=V°V =0, 
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(a) (b) 
B constant with x (dipole) B linear with x (quadrupole) 
B B 
x x 
(c) (d) 
B quadratic with x (sextupole) B linear with x, non-zero at origin 
(gradient dipole) 
B B 
x x 


FIGURE 4.1 Common magnetic field distributions used by accelerators and their associated name or 
pole configuration. 


which means that the complex function, F = A + iV also satisfies the Laplace equation. If 
we now constrain ourselves to working in two dimensions, then we can find the potentials 
which satisfy the Maxwell equations above. First, we note that any analytic function of 
the complex variable z = x + iy also satisfies the Laplace equation and so we can use a 
convenient function C,,z” = A +iV to help us find the vector and scalar equipotentials (i.e. 
contours of a particular constant value) which will map out the lines of flux and possible 
pole shapes for some standard magnet types. The potentials will also enable us to calculate 
the magnetic field according to the equations 


OA oV 


OA OV 
By = ag = OG (4.6) 


Note that the magnetic field is given by the gradient of the potential. This tallies with our 
understanding that when flux lines are densely packed together the fields are highest. 


4.1.1 Case n=1: Dipole 


In the general case we can write 


5S Cyz” =A +iV, 
n=1 
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FIGURE 4.2 The scalar equipotential for a uniform magnetic field is a horizontal steel pole surface. 
The vector equipotentials map out the flux lines; as they are equally spaced, the magnetic field must be 
perfectly uniform. 


but for now we will restrict ourselves to the simplest case of n = 1 only. In this case then 
Cız = Cı (x + iy) = A + iV. (4.7) 


If Cı is real we can gather the real and imaginary terms and see that the potentials are 
given by 
A =Cia, (4.8) 


Differentiating, according to the equations above, to find what value of B, and By these 
potentials represent gives us 


Bs = ond (4.10) 
Oy 
OA 

By, = a TO (4.11) 


And so the case n = 1 gives us a constant magnetic field in the vertical plane. The equipo- 
tentials for this case are plotted in Fig 4.2 and, as expected for a perfect vertical field, the 
vector potential maps out the equally-spaced vertical flux lines, which are orthogonal to the 
lines of scalar potential (this is always true in fact). These scalar potential lines define the 
perfect steel pole surface that will generate these magnetic fields. In this case a pair of hor- 
izontal, parallel, steel poles (a dipole) equally spaced about the horizontal axis, extending 
out to infinity in +x are required. One pole is determined by V and the opposite one by 
—V. Note that each and every scalar equipotential line represents a possible pole surface; 
there is not just one unique position for the poles, there is a whole family of poles which 
will create this ideal field. The magnet designer can choose the optimum pair of poles which 
meet the physical and magnetic requirements for that particular application. 


4.1.2 Case n= 2: Quadrupole 
For this example we have that 


Coz? = Calz + iy)? =A+ivV, 
Co(a? + 2izy —y”?) =A+iV. 
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FIGURE 4.3 The equipotentials for a quadrupole which generates a field that is linear with position 
from the origin (see Fig 4.1 (b)). The vector equipotentials (shown with arrows) map out the flux lines. The 
field is zero at the centre of the magnet. 


Gathering the real and imaginary terms gives us 


A = Q- y’), (4.12) 
V 2CoTy. (4.13) 


Differentiating the vector potential, as before, then gives us the magnetic fields 


By = —2C2y, (4.14) 
By = —2Cgz. (4.15) 


So, the vertical magnetic field on the horizontal axis is linear with x, and zero at the 
origin, as required for a focusing magnet. The field along the vertical axis is horizontal and 
linear with y (with the same coefficient as B,) and so is also providing a focusing effect. 
Unfortunately, due to By and By, having the same sign in the equations above, one axis will 
focus the beam towards the origin whilst the other axis will defocus the beam away from 
the origin. This well-known concept that a quadrupole focuses in one plane and defocuses 
in the other is fundamental, as we can now see. The fields must obey Maxwell’s equations 
and this is a direct consequence of that requirement. 

The equipotentials for this case are plotted in Fig 4.3 with both A and V mapping 
out rectangular hyperbolas (which means the asymptotes are perpendicular to each other). 
Again, we can see from the lines of vector equipotential that they become more densely 
packed away from the origin, indicating the field strength increase. The scalar equipotentials 
map out ideal steel pole surfaces, which in this case extend to infinity along both the x and 
y axes. There must be four poles — one per quadrant — and hence this is called a quadrupole. 
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FIGURE 4.4 The equipotentials for a sextupole which generates a field that is quadratic with position 
from the origin (see Fig 4.1 (c)). The vector equipotentials, shown with arrows, map out the flux lines. The 
field is zero at the centre of the magnet. 


4.1.3 Case n= 3: Sextupole 
For this example we have that 
C32? = Ca(x+ iy)? =A+iV, 
C3(x° — 3zy? + 3ixz?°y — iy?) = A+iV. 


Following the same procedure as before gives us 


A =C3(z? — 32y"), (4.16) 
V =C3(3x?y — y’). (4.17) 


Differentiating the vector potential, as before, then gives us the magnetic fields 


B, = —6C3xy, (4.18) 
B; —30327 + 3C34?. (4.19) 


Now we can see that the vertical field on the x axis is quadratic in x and zero at the 
origin. The equipotentials for this case are plotted in Fig 4.4. The scalar equipotentials 
have asymptotes at 0°, 60°, 120°, ... and so there are six poles required for this field shape 
— hence the term sextupole. 


4.1.4 Case n=1,2: Gradient Dipole 


As mentioned earlier, a common magnet which combines two multipole types is the gradient 
(or combined-function) dipole, which has a non-zero field on axis and has a field varying 
linearly with x; see Fig 4.1 (d). As we know that we want a combination of dipole and 
quadrupole, we follow the same procedure as before but this time include the terms for 
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FIGURE 4.5 The equipotentials for a gradient dipole which generates a field that is non-zero at the 
origin and linear with position (see Fig 4.1 (d)). The vector equipotentials, shown with arrows, map out the 
flux lines. 


both n = 1 and n = 2: 


Cız + Caz? = Cr(a + iy) + Co(a + iy)? =A+t+ iV, 
Cix +iCiy + Calz? + 2izy—- y) = A+iV. 


Gathering the real and imaginary terms gives us 


A = Cx = Cox? = Coy’, (4.20) 
V = Cyt 2Cory. (4.21) 


Differentiating the vector potential then gives us the magnetic field 


So, the field varies as required and the ideal pole shape is found by plotting lines of constant 
V for the required values of Cy and Cg, see Fig 4.5. This particular magnet can also be 
considered to be a simple quadrupole, but with the beam axis offset from the physical centre 
of the quadrupole so that the field at the origin is non-zero. If one applies a geometric shift 
of the origin along the z-axis to a normal quadrupole description then exactly the same 
pole shape equation is found. 


4.2 Generating an Arbitrary Magnetic Field Shape 


For the standard magnet types the pole shapes are well known, with several examples being 
given in the previous section. So, when asked to design a quadrupole, for example, the 
ideal pole shape is already known to be a hyperbola and the designer must optimise the 
magnet for maximum efficiency, which normally means minimizing the magnet aperture. 
They must also choose how to approximate the pole shape to the ideal, which extends to 
infinity, given a field quality specification over a particular physical region which is required 
by the accelerator. Such choices are important, to make sure a magnet performs as expected 
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and will be covered later in this chapter, but not particularly challenging from a physics 
perspective. A more challenging problem for a magnet designer is to be given a magnetic 
field profile (i.e. B, as a function of x), that does not correspond to a well-known type, and 
to design a magnet that will generate the required fields. The first step in this case is to fit 
the field profile to a polynomial of the form: 


B, = Bı + Bx + Bg? + ..., (4.23) 


where B, represents the dB/dx (quadrupole) term and BY represents the d?B/dz? (sex- 
tupole) term and so on. Then we can write 


OV Haaa 


B= i = Bı + Bha + B}x +... 
V = = (B1 + Bge + Bf? + .)dy 
V = -(B, + Bz + Bgx? +...)y 
y = 4 . (4.24) 


Bı + Bhat+ Bua? +... 


A line of constant scalar potential will then define the ideal pole shape that will generate 
the required field. The optimal value of V is normally the one which minimises the magnet 
aperture, and hence the required Ampere-turns in the coils, within the physical boundary 
conditions set by the other factors at play, such as the beam aperture requirements or 
achieving a particular vacuum level. Later in this chapter we will look at how these ideal 
magnetic fields and pole shapes, which extend to infinity, are dealt with in real-life situations. 
The skill of the engineer or magnet designer is to generate the magnetic field of the correct 
shape and of sufficient quality in the region where it is required by the beam in as efficient 
a manner as possible. Here, efficiency normally equates to cost to build and cost to operate. 


4.3 Magnet Multipoles 


We have already noted that we describe different magnetic field distributions in terms of 
‘multipoles’, with examples being pure dipole, pure quadrupole, and so on. In this section 
we will define multipoles more formally and explain how we use them to specify and judge 
the quality of a magnetic field. In general, all physical distributions of magnetic field in two 
dimensions in a region free from steel and coils can be described by an infinite sum of all 
multipoles [2]. 


oo 
X O 


n=1 


Yo Cale + iy)". (4.25) 


n=1 


By +iB, 


A pure multipole has Cn 4 0 for just one term in the series (n = 1 is a dipole, n = 2 isa 
quadrupole, etc). We also note that Cn is a complex constant so 
oo 
By +iBs = X_ (Jn + iKn)(£ + ty)". (4.26) 
n=1 
The coefficients Jn and Kn characterise the strength and orientation of each multipole 
component. The units of these coefficients are different for every value of n (e.g. Jı is in 
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T, Jo is in T/m, J3 is in T/m?) which can get cumbersome. A common approach is to 
normalise the coefficients so they become dimensionless. This is achieved by multiplying 
the expression by a reference field Bef and dividing by a reference radius R,¢ raised to 
the power n. Note that Bres is the actual magnitude (in T) of the main field (i.e. the actual 
multipole we are interested in) measured at the position Rref. The actual choice of what 
Rreg to select is arbitrary but should be stated, with a typical value being 2/3 of the magnet 
inner radius since this is often a good approximation to the full extent of the beam within 
the magnet. We can now write 


[oe] . n-1 
By +iBs = Bret X (int ikn) (= 2) . (4.27) 
ref 


n=1 


So, for the pure vertical dipole case, where B,.¢ = Jı (in T) we can see that jı = 1 (dimen- 
sionless). Similarly, for the quadrupole, with Bref = J2Rref, we can see that jo = 1. This 
clearly demonstrates that we have normalised the multipole expansion. For a good-quality 
magnet, the other coefficients would be expected to be <0.01% of the main component and 
so to make discussion and comparison of different magnets a little easier, it is also common 
to multiply the expansion again by the constant 1074 so that the main component has the 
value 10,000 and the other components have values of around unity. Magnet designers will 
(confusingly!) talk about how many ‘units’ of a particular multipole are present in their 
magnet, and it is this further normalised case that they are referring to. 

In the accelerator environment there are two orientations of multipole field that are 
utilised. The first is called normal and the second is called skew. The normal cases are 
those where the magnetic field is vertical on the horizontal axis and, in fact, all of the cases 
considered earlier were of this type (see Figs. 4.2 to 4.4). The skew cases are those where 
the magnetic field is horizontal on the horizontal axis. We can see from the figures that 
skew magnets are simply normal magnets that are rotated by 7/2n about the axis. More 
formally, the normal cases are characterised by Cn being real and the skew cases when Cn 
is imaginary. In other words, the J, terms represent the normal multipoles and the Kn 
terms the skew multipoles. It is easy to see that if we repeat the n = 1 example from earlier 
(Section 4.1), but this time assuming that C; is imaginary, we will find that the magnetic 
field is still a perfect dipole but that it is now oriented in the horizontal plane (i.e. it is a 
skew dipole). 


Field Errors 


Of course, a perfect multipole only contains one term in the multipole expansion and as 
such contains no field errors. Unfortunately, such magnets require infinitely wide steel poles 
or equally as unrealistic current density distributions (if we choose not to use any steel, 
as we shall see later in Section 4.6). In practice, we must design a magnet which is an 
excellent approximation to the ideal, which in this case means the steel poles have a finite 
extent. This unavoidably introduces systematic field errors even if we then build our design 
with no physical imperfections. This type of error is sometimes called an allowed error. The 
possible multipole terms which can generate these allowed errors are limited by symmetry 
and polarity [1] to those which generate a field in the same direction if they are rotated by 
m/n and have their polarity reversed. So in the n = 1 case, if we rotate the dipole by 180° 
and reverse the polarity, the field direction is the same, but if we rotate a quadrupole by 180° 
and reverse the polarity, then the field direction is misaligned. If we try the same rotate- 
and-reverse process on the sextupole, then it is aligned and so a dipole magnet will contain 
a sextupole error term. The formal generalisation of this is that the allowed multipoles are 
those that satisfy 

Nallowed = n(2m + 1), (4.28) 
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where m = 0,1,2,... . Clearly m = 0 corresponds to the multipole we are trying to generate 
but the subsequent ones are all error terms. So a dipole magnet will include multipole errors 
of sextupole, decapole, 14-pole, etc. A quadrupole will contain errors due to 12-pole, 20-pole, 
and so on. 

Manufacturing tolerances mean that the magnets are not perfect and that symmetries 
will be broken. This implies that a real magnet can and will have non-zero values for all the 
Jn and Kpn coefficients. In fact, certain multipole errors can point to particular fabrication 
errors in terms of what symmetry has been broken [3]. 

A common method of specifying the field quality of any particular magnet is to put 
absolute limits on each of the multipole terms up to a sufficiently high order. The limits 
should be determined by thorough beam dynamics simulations as should the determination 
for which orders are critical. An alternative approach is to define the field quality in terms 
of how the absolute field level is allowed to vary within the good field region. For example, 
a dipole magnet may be specified to have a maximum field variation locally (i.e. at some 
specific longitudinal position within the magnet) of up to 0.01% of the main field, or a 
quadrupole might be specified to have a maximum gradient deviation locally of up to 0.01%. 
This alternative method puts absolute limits on the magnetic field performance but makes 
no comment on the actual multipole content. It is also important that integrated field levels 
and quality are specified in both cases. This means that the field quality should be judged 
through the length of the magnet (the fields are integrated in the beam direction) since this 
is what the beam will do! A high-quality dipole in a storage ring would be expected to have 
an integrated field variation of better than 0.01%, but in a single pass accelerator a level 
of 0.1% may well be sufficient. Similar numbers apply to integrated quadrupole gradient 
errors. 


4.4 Electromagnets 


The magnet type of choice for most accelerator applications is one based upon the use of 
current-carrying resistive (or normal conducting) coils. The alternative technologies which 
are based upon superconducting coils or permanent magnets have very important appli- 
cations in accelerators, and will be discussed later, but they are generally employed when 
the standard electromagnet is unable to meet the required needs of the accelerator. The 
electromagnet is popular because they are well understood, relatively straightforward to 
design and build, are extremely reliable, available from industry, and easily adjusted by 
simply changing the current in a coil. In this section we will look at three different types of 
electromagnet; DC, AC, and pulsed. The first is DC (direct current) which means that the 
current is held constant and so the field is static. Of course, this does not mean that the field 
cannot be changed by altering the current, just that a static field is required by the accel- 
erator. This type is used, for example, in a storage ring or transfer line which operates at a 
fixed beam energy day after day. The second type is AC (alternating current) which means 
the current has a time-varying, periodic, waveform. This is used to generate time-varying, 
periodic, magnetic fields as is required in a synchrotron, for example. Strictly speaking, 
AC implies that the current reverses direction in the circuit but this is often not the case 
in accelerator magnets where AC is used as shorthand to indicate that the magnetic field 
is periodically varying with time between a minimum and a maximum value, often with 
the same polarity. Also, the current waveform is typically not sinusoidal. The waveform is 
determined by the needs of the accelerator within the limitations of the magnet and power 
supply circuit. The third type of electromagnet is the pulsed magnet. These are magnets 
which are energised by a current pulse as and when required, and are off the remainder of 
the time. This type of magnet might be used to capture a beam injected into a storage ring 
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FIGURE 4.6 Cross section through an H-dipole, which is used to generate a uniform magnetic field. 
The electron beam is travelling into the page at the centre of the magnet, in the region between the two 
poles. The upper and lower poles each have a coil encircling the pole. The cross and dot within the coil 
cross section denotes the current flowing into and out of the page. 


or to rapidly discard (or dump) a beam as part of a machine protection system, for example. 
In this section we will consider each of these three electromagnet types in practical terms 
and discuss the issues which should be taken into account when designing and building 
them. 


4.4.1 Practicalities of DC Magnets 


From the earlier section looking at the families of standard accelerator magnets we now 
have a theoretical understanding of the ideal pole shapes for each type. We will next look 
at how these theoretical pole shape curves are turned into real devices, starting with the 
example of a DC dipole. 


Dipoles 


For a perfect uniform magnetic field we found in Section 4.1.1 that we need a pair of 
infinitely wide horizontal, parallel, steel poles. We approximate these with a pair of finite- 
width, parallel poles which are energised by a coil wrapped around each pole. To complete 
the magnetic circuit efficiently, we need to connect the two poles with steel, away from the 
region of interest where the particle beam will travel, since steel has a much higher relative 
permeability than air. A popular dipole design is the so-called H-type, illustrated in Fig 4.6, 
because it is symmetric, supported mechanically on both sides, and has coils which are a 
simple shape to wind. The name simply comes from the H shape that the steel parts define 
in the central air region. 

To calculate how the current flowing in the coils relates to the magnetic field at the 
centre of the dipole we need to refer back to another of Maxwell’s equations 


OD 
V x B= uruoJ + rome (4.29) 


122 The Science and Technology of Particle Accelerators 


where upr is the relative permeability, 4o is the permeability of free space, J is the electric 
current density, and D is the electric displacement field. In our case D is unchanging with 
time (and is zero) and so can be neglected. We therefore have 


V x B = uppod. (4.30) 
Applying Stokes’s theorem from vector calculus, which states that if F is a smooth vector 
field, then 
[vxPeas= F-a, (4.31) 
S P 
where P is a closed path that is the boundary of the surface S, we obtain 
B 
-d= | J-dS= NI. (4.32) 
P Hro S 


The integral of the current density over the surface is simply the current flowing through 
the surface, which is conventionally written for a magnet coil as NI to represent a coil 
with N turns of wire carrying a current J. It is very common to talk about the number of 
Ampere-turns provided by a coil; this is simply shorthand for the product NT. 

So, now we can see that if we know what magnetic field we want in our system then by 
integrating this field along a closed path — of our choosing — we can determine what current 
will be needed to develop this field. Returning to the H dipole of Fig 4.7, the closed path 
has been chosen to be made up of three parts. Path 1 is in the air region (ur = 1), starting 
from the centre to the pole tip (a distance g/2, where g is the full magnet gap), and the 
field is uniform with value Bo, path 2 is in the steel where the magnetic field will be of 
similar magnitude to Bo but where ur will be very large, and path 3 completes the loop 
through both steel and air but at the midplane of the dipole where B is always orthogonal 
to the horizontal axis and so the dot product will be zero. So, in the limit where the relative 
permeability of the steel tends towards infinity, we have that 


NI = | p -dl + B -dl + p - dl, 
P1 LrHo P2 HrHo P3 HrHo 
gBo 
NI = =—. 4.33 
TA (4.33) 


Remember that this value of NI is for the top coil (enclosed by our selected path); there 
will also need to be the same number of Ampere-turns in the bottom coil to generate Bo 
across the full magnet gap. An interesting point to note is that, in the H dipole configuration, 
winding the coils around the return yoke, or back leg (as shown in Fig 4.8) is not an efficient 
solution. One might assume that using 4 coils of NJ Ampere-turns instead of two would 
double the field at the centre of the dipole, but in fact it just generates the same field as 
before since the line integral of B bounds the same NIJ as the case where the coil is wound 
around the pole. In fact, the coils around the back leg act to generate field outside and away 
from the dipole, in the region where it is not required! A second point to note is that in 
this idealised case, where the relative permeability of the steel tends towards infinity, the 
pole width does not appear in the equation. So, no matter how wide the pole is, the same 
magnetic field will be generated in the air gap between the poles, at least in the central 
region. Also, the integral of B,/, in the air region along any vertical path parallel to path 1 
is a constant. In the central region of the magnet, the field is uniform in y but towards the 
sides where the pole terminates, the vertical field in the plane of the magnet (the x axis) 
begins to fall away. We can conclude from this that, since the integral is constant, the 
vertical field, B,, must therefore increase with y, as we get closer to the steel surface to 
compensate. This is indeed the case; B, increases in the region near to the pole corner. 
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FIGURE 4.7 Cross section through an H-dipole showing the closed integration path chosen to calculate 
the current needed to generate a particular field value. 
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FIGURE 4.8 Cross section through an H-dipole where four coils are wound around the back leg instead 
of two around the pole. This generates the same field at the magnet centre as the two-coil version and not 
double the field as one might intuitively expect. 
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FIGURE 4.9 Close-up of dipole pole region illustrating the use of small pole shape adjustments, or 
shims, to counteract the B field decay towards the pole edges. The shims can be made of separate steel 


pieces fastened to the pole or, more typically, be integral to the pole itself. 


Returning to our practical DC dipole, the accelerator designer will define a region where 
the magnetic field must meet some particular field quality specification. This region is often 
referred to as the good field region of the magnet. The size, shape, and quality of the field 
in this region is usually determined by extensive beam dynamics simulations to understand 
what part of the magnetic field the charged particles could pass through under a number 
of scenarios and how they will behave in an imperfect (i.e. a realistic) field. The magnetic 
field outside of the good field region is of no relevance to the beam since it should never 
encounter this part of the field. So, the magnet designer will optimise the magnet to achieve 
the required specification in the good field region and no more. In a dipole a particular 
absolute field level and uniformity is typically specified over a certain region. The magnet 
designer will choose a pole width which just achieves this. In our simple H dipole design, the 
vertical magnetic field is constant in the central region between the poles but towards the 
edges, where the pole is terminated to allow space for the coil, the field starts to decrease. By 
making some small reduction to the pole gap in this part of the magnet, this intrinsic field 
decay can be counterbalanced, and so the useful field of the dipole is extended horizontally 
with a very simple change. This minor change to the pole shape towards the pole corner is 
called shimming and is illustrated in Fig 4.9. The alternative to including this extra steel 
at the pole extremities would be to simply make the pole a little wider. This would be 
perfectly acceptable but just more expensive. As a general rule, the cost of a magnet scales 
with its mass so more material means greater expense. 

So far we have only considered ideal steel performance with extremely large relative 
permeability. This is a good approximation at low fields where a relative permeability of 
several thousand is common, but ur is not a constant with B and tends towards 1 at very 
high fields. An example graph of relative permeability for a good-quality, common, magnet 
steel is shown in Fig 4.10. For this example ur > 1000 until around 1.4 T, and by 2 T 
it is around 50. This means that the approximation used earlier to calculate NJ at such 
high fields will no longer hold and additional Ampere-turns will be necessary to achieve the 
required field in air because the line integral along path 2 is no longer negligible. Calculating 
the impact of this non-linear behaviour of ur is not possible analytically; instead there are 
several magnet modelling software tools which can be used to calculate the fields numerically, 
either in two or three dimensions. 

An alternative to the H dipole is the C dipole, which is illustrated in Fig 4.11. The name 
here reflects the C shape of the steel yoke. It is effectively one half of an H-type magnet. 
This design is less rigid mechanically because it is only supported on one side, but access 
from the side is now possible, which makes magnetic measurements easier and can also be 
important for certain applications; these include synchrotron light sources where beams of 
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FIGURE 4.10 Graph showing the relative permeability as a function of magnetic field for a good-quality, 
low-carbon, commonly used magnet steel (XC10/AISI 1010). 


X-rays from upstream undulators, or the dipole itself, would otherwise intercept the steel 
back leg as they exit from the accelerator. 


Quadrupoles 


Next we will consider the DC quadrupole, starting with the pole shape determined in 
Section 4.1.2 and visualised in Fig 4.3. The ideal steel pole surface extends out to infinity 
horizontally and vertically and adjacent poles approach each other. The consequence of this 
is that, at face value, there seems little prospect of finding the space to wrap a coil around 
each pole. Fortunately, since we are only interested in generating high-quality magnetic fields 
in the good field region, we can choose to terminate the pole asymptote at an appropriate 
position and then shape the steel away from this region so that space for coils is created 
along with a return path for the magnetic flux within the steel. An example of a standard 
quadrupole cross section is given in Fig 4.12. To determine the Ampere-turns required to 
achieve a particular quadrupole field gradient, we must follow a similar integral along a 
closed path calculation to that used earlier for the dipole. Path 1 is in the air region where 
the field at the centre of the magnet is zero and it increases linearly, reaching Bo at the 
pole tip, which is at a radius ro from the magnet centre (so the field integral along path 1 
is roBo/2). Note that the field direction is radial only (i.e. co-linear with Path 1). Path 2 is 
within the steel region, which has very large relative permeability, and path 3 completes the 
loop through both steel and air, but at the midplane of the quadrupole where B is always 
orthogonal to the horizontal axis and so the dot product will be zero. 
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FIGURE 4.11 Cross section through a C-dipole, which is used to generate a uniform magnetic field. 
The electron beam is travelling into the page in the region between the two poles. The upper and lower 
poles each have a coil encircling the pole. The cross and dot within the coil cross section denote the current 
flowing into and out of the page. 


FIGURE 4.12 Cross section through a quadrupole. The electron beam is travelling into the page in the 
central region between the four poles. The cross and dot within the coil cross section denote the current 
flowing into and out of the page. 
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FIGURE 4.13 Close-up of quadrupole pole region illustrating the use of small pole shape adjustments, 
or shims, to counteract the B field decay towards the pole corners. The shims are typically integral to the 
pole itself and are tangent to the pole asymptotic shape but this is not essential. 


NI = | B a+ f B -dl + n - dl, 
P1 LrHo P2 HrHo P3 HrHo 
roBo 
NI = . 4.34 
Dpto (4.34) 


The field gradient generated by the quadrupole, with NJ Ampere-turns per coil, is therefore 


dB, S Bo = 2uoNTI 


4.35 
dx ro a ey) 
This analysis is effectively the same as for the dipole and it can similarly be further extended 
for higher-order magnets as required. For example, in the case of a sextupole, which is 
parameterised by the second field derivative, we find that 


By _ 6poNI 
dx? r3 


l (4.36) 


As for the dipole case, the magnet designer has the option of making adjustments to the 
quadrupole steel pole shape at the extremities to counteract the natural field roll off due to 
the pole corner. A small amount of extra steel is included at the pole corner, to bring the 
field back up to the required level over a short distance. This is a cost-effective change that 
can readily be made to any design, maximising the good field region for a particular steel 
pole width. Fig 4.13 illustrates how this shim is often included as a simple tangent to the 
hyperbolic pole shape. 
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Coils 


The conductor which is wound to form the coils is typically made of copper, although 
aluminium is also sometimes used. Even though copper is a very good electrical conductor, 
each coil will have a finite resistance and so ohmic heating is an important consideration for 
electromagnets. To achieve a particular field level, we need to provide sufficient Ampere- 
turns, NI, via the coils. The choice of how many turns, N, to have in each coil obviously 
dictates the current that the conductor has to carry. A very large number of turns means 
that a relatively low current is needed, whereas very few turns in the coil means that a 
relatively large current must be supplied. The designer must select N and J so that the coil 
is practical to build and operate but should bear in mind that there are a range of solutions 
possible, to some extent this is somewhat of an arbitrary choice. A practical coil is one 
which can operate at full current continuously without breaking down or overheating. To 
prevent overheating, even of a well-designed coil, some form of cooling is generally required. 
For coils which consume low power, a heat sink might be adequate but more often than 
not active water cooling is required. For moderate power this can be indirect cooling, which 
generally means a water-cooled surface (e.g. a copper plate) is thermally attached to the 
coil, but for higher powers, direct water cooling is required to extract the heat. Direct 
cooling means that water flows through the copper conductor itself by using a cross section 
of conductor with a central hole for the water. Effectively the conductor is a thick-walled 
tube wound to make the coil. The key feature of this direct cooling is that the water 
flows directly through every turn in the coil and so the inner windings of the coil — which 
are the hottest and so most vulnerable to thermal failure as they are fully surrounded by 
other turns and so have no surface exposed to air — also receive the direct benefit of the 
cooling water. For this reason directly-cooled coils can tolerate higher currents than those 
cooled passively or indirectly. Strictly speaking, it is a higher current density within the 
conductor which can be tolerated by directly-cooled coils since the electrical resistance of 
the conductor is proportional to the conductor cross-sectional area. As a rule of thumb, if 
the current density is below 1 A/mm? then air convection cooling is sufficient; indirect water 
cooling can be used up to 2 or 3 A/mm?, and direct water cooling used above this value. 
There is no clear limit on current density for direct-cooled coils although staying below 10 to 
15 A/mm? is often quoted as good practice. However, all of these numbers are just provided 
for guidance and examples can be found where low current densities do need cooling, and 
vice versa. The magnet designer must consider the thermal performance of every coil to 
satisfy themselves that it will operate safely and reliably and to define the water-cooling 
requirements, such as inlet temperature, outlet temperature, flow rate, and pressure. Again, 
there is no absolute maximum value for the water temperature rise which can be tolerated, 
but water temperature increases between the inlet and the outlet of between 10 and 20°C 
are typical. 

Since the cooling water flows directly through the current-carrying conductor, it must 
have low conductivity to prevent unwanted breakdowns. Demineralised (sometimes called 
de-ionised) water, which has had almost all of its mineral ions removed, is usually employed 
with a resistivity of around 5 MQ cm. This water can be rather corrosive, leaching out mate- 
rial from the coils and piping, and so care must be taken with the correct use of compatible 
materials and avoiding particular combinations of materials at joints, for instance [4]. 


Steel Yoke 


The steel structure which forms the main body of the magnet is called the yoke. The steel is 
shaped to form a continuous, low magnetic reluctance, high permeability flux path except 
in the air gap region of the beam. Steel that is close to the beam air gap, deliberately 
shaped to create the required magnetic field shape, is called a pole and steel that joins the 
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poles to form this continuous path, away from the air gap, is called a back leg. In reality 
it is normal for the complete steel yoke to not be made from a single piece for practical 
engineering reasons. These reasons could be the feasibility of physically mounting the coils 
around a pole, or allowing the whole magnet to be split into two halves so that the beam 
vacuum chamber can be installed. Such joints in the steel yoke have a negligible impact 
on the field levels, assuming any potential air gap at the joint is minimised, but care must 
be taken to ensure that the mechanical quality of the yoke is maintained by making sure 
such joints can be made repeatedly without changing the physical shape of the yoke. This 
repeatability is often guaranteed, to within tight tolerances (e.g. to within ~20 to 30 um), 
by the use of pins or dowels. 

The steel pieces that form the yoke can be machined from solid steel or built up by 
stacking thin (~1 mm) steel sheets or laminations. When time-varying magnetic fields are 
required then laminations must be used because of eddy current effects, as we shall see later. 
However, for static magnetic fields, both solid and laminated options are feasible. Lamina- 
tions are shaped, using a stamping tool, to form the transverse cross section of the magnet 
and then stacked up in the longitudinal beam direction before being permanently joined 
together (often using gluing but also involving welding or mechanical fixtures sometimes) 
to form a single unit. The use of a stamping tool means that lamination-to-lamination 
shape repeatability is very good and also is a quick process so can be cost-effective. The 
tool or die, which has to be specially made for each magnet, is quite expensive though, 
so when small numbers of magnets are required, it is often more cost-effective to use solid 
steel yokes. The accuracy of laminations due to the stamping process also means that the 
required mechanical tolerances (~20 to 30 um for the pole surface) for long magnets can 
be achieved more readily since machining over long pieces of steel reduces the precision 
achievable. Another advantage of using laminations is that any variation in the magnetic 
properties from batch to batch from the steel manufacturer can be considered more readily 
by mixing up — or shuffling — the laminations from the various batches within each single 
yoke. Such steel property variation can otherwise lead to small but significant differences in 
performance from magnet to magnet. 


Longitudinal Issues 


This section has so far concentrated on designing transverse cross sections for dipoles and 
quadrupoles and then considerations for the coil requirements to achieve the required field 
levels. Establishing these transverse designs is the first crucial step for any magnet design. 
The next step is to consider the longitudinal design (i.e. along the length of the magnet, 
in the direction of the beam). The length requirement is set by the needs of the particle 
beam, what angle the dipole must bend over for example. In general, the cross section 
is held constant through the majority of the magnet with some modification at both the 
entrance and exit. As for the transverse pole shape case, where we avoid abrupt steps in 
the steel shape since sharp ‘corners’ can readily become highly saturated, the same is true 
longitudinally. It is standard for a dipole to have a smooth change or roll-off in magnet 
gap at the beam entrance and exit to lower this saturation effect. The transverse shims 
may well need to be adjusted in size to compensate. Overall the integrated dipole field (or 
specific integrated multipole terms) through the magnet must be kept within predetermined 
limits and the end terminations are an important contributor to these integrals that must 
be carefully assessed and minimised with an appropriate end design. A similar approach is 
also taken in quadrupoles, although in general terms, the impact and need for the roll-off 
becomes less critical at higher orders and straight angular cut-offs or chamfers are sometimes 
implemented as an acceptable approximation to a smoother profile. 
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Forces and Stored Energy 

By considering the inductance of, and so the stored energy in, a solenoid, it is relatively easy 

to show that the magnetic energy density in a system (energy per unit volume, dE/dV) is 
di aE = B? 
dV  dzdyds  2uour’ 


(4.37) 


so to calculate the total stored energy in any magnet, we need to integrate this equation 
over the full volume, including all the yoke and coils. This is not a trivial integral and 
accurate solutions are only possible by using numerical codes. In a simple approximation 
for a uniform dipole field, Bo, where the steel is not saturated and so ur is very large in 
the yoke, we can choose to ignore the stored energy within the steel and just calculate the 
stored energy in the air gap. In this case the approximate stored energy is 


B= BegA 

210 
where g is the gap between the poles and A is the area under a pole, and the product gA 
is simply the volume between the poles down the full length of the magnet. This can be 


useful to estimate the inductance L of a dipole, remembering that the energy stored by an 
inductor is LI*/2; 


(4.38) 


(4.39) 


Substituting the approximate formula for the full Ampere-turns of a dipole required to 
achieve the field By we get 
HoN 2A 
i 
So, the inductance of a dipole depends upon the number of turns in the coils as well as the 
physical extent of the field and the magnet gap, it does not depend upon the peak field or 
the current in the coil. 
If we use the standard result that the work done (energy) is force x distance, i.e. dE = 
F - dy, we can express the force exerted between the two poles as 


dE B? 
F= // qy dtds =u Sa (4.41) 


So, in a region of constant magnetic field, the force would be 


iE (4.40) 


_ BRA 
20 ` 


F (4.42) 


Again, in the real world, to calculate forces accurately we must use a numerical code, but 
the above equation is useful to check that the code is giving realistic values. We also need 
to consider the force on the coils. We know that magnetic fields exert forces on charges, 
since this is why we use magnets in accelerators in the first place! We know that the force 
F on a charge q moving with velocity v within the presence of a magnetic field B is 


F =qvxB. (4.43) 


If there are N charges per unit volume, the number in a small volume dV of the coil is 
NdV. The total magnetic force on the volume dV is simply the sum of the forces on the 
individual charges, so that 

dF = NdV(qv x B). (4.44) 
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If we remember that the current density, J, is the flow of current per unit area through the 
wire, then we can see that J = Nqv and so 


dF = (J x B)dV, (4.45) 


and the force on a coil is given by the integral 
F= J J x BdV. (4.46) 


It is clear that to calculate the force acting on a coil is not trivial, requiring an understand- 
ing of the magnetic field distribution within the coil, and so we rely on numerical codes 
once again. To understand the direction that the force is acting on the coil, we just need 
to apply the well-known right-hand rule. Consideration of the forces on conventional DC 
magnet coils is generally quite superficial, but this is far more important when working with 
superconducting magnets, where both J and B can be very high, as we shall see later in 
this chapter. 


Reliability Issues 


DC electromagnets can be extremely reliable components in a particle accelerator and some- 
times failures are so infrequent that we can become complacent. We have worked on several 
accelerators which have suffered no major magnet failures at all during their lifetime. Such 
high reliability is built on making good design and material choices, having thorough testing 
and acceptance criteria before installation, and routine maintenance schedules. Often when 
a magnet system causes accelerator downtime it does not mean there has been complete 
magnet failure, and instead it is likely to be a water leak, water blockage, or a poor electrical 
connection. It is relatively rare for a coil to fail and need replacing and very rare for a steel 
yoke to fail. A survey of accelerator magnet reliability [5] found that water leaks from hoses 
or fittings were the most common cause of problems. Non-conductive hoses must be used 
when hollow direct water-cooled coils are employed and these hoses are always made from 
organic molecules (thermoplastics or elastomers) which can be damaged with ionizing radia- 
tion, obviously a problem for high-energy particle accelerators. The hoses degrade with time 
and eventually will become brittle and crack. Regular replacement of hoses before they fail, 
at a rate determined by the level of radiation they are subject to, is highly recommended 
and easy to overlook. Water fittings are a rather mundane item for a particle accelerator 
and so it is possible to not pay enough attention to them. Our experience, born out by 
this survey, is that water fittings do fail or leak and it is worth buying the more expensive 
fittings (which are still cheap when compared to the magnets themselves!) to increase relia- 
bility. The second highest cause of failure found was water leakage at brazed joints. Brazed 
joints are very difficult to avoid completely (e.g. they are used to connect water fittings to 
the hollow conductor ends) but when we procure the magnet we can certainly insist that 
we don’t want them hidden away inside of the magnet coils. The best way to avoid this 
failure is to have a thorough coil and water fitting testing regime before the magnets are 
assembled. Should a braze fail, then the best fix is likely to be to change the coil, which 
means having a spare one available. So, when procuring magnets we always buy at least 
one spare coil of each type. Having said that, we have not had to exchange a coil for many 
many years, which probably reflects the high manufacturing standards routinely offered 
now by magnet suppliers, as well as the factory tests mentioned earlier. We continue to buy 
spare coils though just in case! The survey also noted that almost half of the accelerators 
surveyed suffered from water blockages in the cooling system every year. Such a blockage 
can be relatively simple to fix by flushing water through the system in the opposite flow 
direction. However, it is far better to carry out routine maintenance to prevent the buildup 
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of deposits which are causing the blockages. Of course, if water flow levels are dropping day 
by day then this is a warning that it is time to intervene before the coil becomes completely 
blocked. 

Other issues which were noted by the survey were that the electrical connections, which 
are typically bolted joints for the higher current magnets, can work loose over time or not 
be tightened sufficiently on installation. This is an issue which regular routine maintenance 
and inspection should find and resolve easily. Another issue to note, that we certainly have 
encountered in the past, is that the polarity of a magnet is easy to get wrong by incorrect 
electrical wiring to the magnet coils or at the power convertor. We carry out polarity checks 
on all magnets, using a simple hand-held Hall probe, whenever any electrical connections are 
touched. The time required to carry out these checks is much smaller than the time wasted 
in the control room later if a magnet is acting in the opposite mode to that expected! 


Specifying and Procuring of Magnets 


One of the most daunting things to face a newcomer to the field is being asked to procure 
a batch of magnets for a particular accelerator project. Designing a magnet with bespoke 
magnet software is one thing, but then converting your design into a real working magnet 
is another step up. The good news is that it appears far more daunting than it really is. 
The best place to start, if possible, is to talk to someone more experienced with magnet 
procurement, someone who has been through the process before. They can help by sharing 
experience, sharing specifications from previous procurement exercises, and by passing on 
relevant contacts in industry. 

There are really two alternative approaches that can be adopted for procurement with 
the difference being a question of who takes responsibility for the overall magnet perfor- 
mance; the procurer or the supplier. Both approaches can work perfectly well but it should 
be clear from the start which approach is being followed. Some of the larger accelerator 
laboratories will choose to take full responsibility for the magnet design, including the com- 
plete mechanical and electrical designs. They effectively produce a pack of drawings, lay 
out a strict production process, and define all materials to be used, and they then pass this 
fabrication pack to a magnet manufacturer and ask them to build exactly as drawn. This 
approach, sometimes called ‘build to print’, means that the procurer takes full responsibil- 
ity for the complete magnet design. The manufacturer is responsible for fabricating to the 
drawings and standards, but if the magnet does not perform as expected then, so long as 
they did what was asked, they will not be held responsible. The second approach is where 
the procurer specifies the magnet performance required and the other constraints, such as 
the space available and beam apertures, but offers no design at all to the supplier. The 
magnet supplier then takes full responsibility for all aspects of the magnet design and if 
the magnet does not perform to specification, it is the supplier’s responsibility to resolve. 
This second approach is more expensive at face value, because the supplier has to design 
the magnet as well as build it, but it does save the procurer significant design effort. We 
have used this second approach successfully for numerous DC magnet procurements for at 
least twenty years, even though we are perfectly capable of carrying out the full magnet 
design ourselves, because we prefer to pass responsibility onto the supplier and we appreci- 
ate that magnet companies are more expert than us in the mechanical and electrical design 
of magnets since they do this every day and we only need to do it occasionally. Prior to 
procurement of standard DC dipoles, quadrupoles, and so on, we carry out some simple 
magnet design simulations, often only in 2D, to confirm that we are requesting a feasible 
magnet and also to gain an appreciation of any particular challenges associated with the 
magnet. Then, we generate a detailed specification explaining exactly what magnetic per- 
formance is required; this specification will typically be about ten to twenty pages in length. 
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The specification covers all aspects of the magnet but crucially does not provide any design 
at all. It is quite detailed so that the manufacturer has all of the required information to 
design and build a magnet that is exactly fit for purpose. Of course, once the contract is in 
place, we have regular communication with the company to ensure the flow of information 
is two-way and that no wrong assumptions have been made by the manufacturer. We also 
insist on a full design review with the manufacturer prior to them starting to actually cut 
metal or wind coils. 

The typical contents of our procurement specification will provide a brief introduction 
to the project, a clear statement that the manufacturer is responsible for the complete 
magnetic, mechanical, electrical, and thermal design as well as the construction, testing, 
and all magnetic measurements. We accept magnets primarily on the basis of the magnetic 
measurements provided as well as mechanical, electrical and thermal checks. Within the 
specification some magnet parameters will be mandatory, such as dipole field level, field 
quality, and bend angle, whereas others will be nominal or even undefined to provide the 
manufacturer with scope to optimise the magnet in an efficient manner. Examples of nominal 
parameters might be the physical dimensions of the magnet, the number of turns in a coil, 
and the conductor cross section. The specification will also describe the mechanical interface 
to the accelerator, such as how the magnet is expected to be mounted onto whatever girder 
or stand is planned, and including the need for survey monuments and lifting brackets. We 
also specify on what side of the magnet we want the power and water connections to be 
placed, since this is important for the installed accelerator infrastructure. 

It is very often required that accelerator magnets can be physically split in half so that a 
vacuum vessel can be installed, and it is important that this need is requested and that the 
magnet performance is unaffected by it being split and reassembled. This means that mating 
and alignment features must be built into the steel yokes to ensure they can repeatably be 
reassembled without affecting their physical shape to quite tight tolerances. We specify 
the magnetic measurement facility performance that is required in order to ensure the 
manufacturer is capable of carrying out adequate measurements after manufacture of the 
magnets. 

For every magnet we provide a table of all of the essential parameters. For a dipole this 
would, as a minimum, include the type of dipole required (e.g. H or C, sector or parallel 
ended), the magnetic field, integrated magnetic field through the magnet, bend radius, 
the field uniformity locally and integrated through the magnet, the horizontal and vertical 
dimensions over which this field uniformity is required (the good field region), the minimum 
pole gap, and the physical space constraints (maximum width, height and length of the 
magnet). 

For a quadrupole this would, as a minimum, include the integrated gradient strength 
through the magnet, the allowed integrated gradient variation within the good field region, 
the size and shape of the good field region, and the physical constraints. We also specify 
some level of thermal performance, such as maximum temperature rise allowed in the cooling 
water (typically 10 to 20°C), but do not specify water flow rates or water channel dimensions, 
for example. With regards to the steel yokes, we allow the manufacturers to propose either 
solid or laminated yokes and also they can choose any suitable magnet steel above some 
minimum level of acceptability. 

For the coils, it is crucial that the insulation is adequate to prevent any electrical break- 
down between turns or between the coil and the yoke. It is common to request that the coils 
be insulated using fibre glass tape wound around the conductor as the coil is fabricated and 
for the full coil to then be mechanically consolidated with a radiation-resistant epoxy resin 
under vacuum impregnation to ensure full penetration within the coil. We do not allow any 
joints in the conductor within a single coil as this is a possible source of unreliability or 
failure, as mentioned earlier. We insist on a set of electrical and thermal tests for every coil 
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prior to magnet assembly. All the coils being thermally cycled several times to confirm the 
epoxy consolidation is mechanically robust. For the electrical tests we check the inter-turn 
insulation and insulation to ground. 

After magnet assembly, we insist on further electrical checks at high voltage between 
the coil terminals and the yoke to ensure there is no breakdown and also thermal tests at 
full operating current to ensure the maximum temperature rise is below the limit specified. 
In general, we insist on a full set of prescribed magnetic measurements for every magnet 
built, although in some circumstances we have accepted a detailed set of measurements 
for the first few magnets and a reduced set of measurements for subsequent ones where 
we are confident that the overall risk to the project is small. Our experience of magnet 
procurement using this approach, from several different European manufacturers, has been 
excellent. We have always received magnets that have performed as required and there have 
been no serious failures at all. 


4.4.2 Practicalities of AC Magnets 


We use the term AC magnets as shorthand to refer to periodically time-varying magnets, 
sometimes called cycling magnets. We understand that alternating current strictly refers 
to current that periodically reverses direction and is often associated with a sinusoidal 
waveform. However, to be clear, that is not what we mean when we use the term AC as 
many magnets are designed to have periodically time-varying fields, such as in a synchrotron, 
where the field does not reverse polarity and nor does it follow a sinusoidal waveform. The 
magnetic field waveform required can take many forms and to first order the overall shape 
is not as critical to the magnet designer as the peak rate of change of the field and the 
repetition rate. 

When dealing with time-varying fields, we have to take account of two extra effects, eddy 
currents and hysteresis. Both of these lead to additional power losses in the magnet on top 
of the resistive (ohmic) losses which DC magnet coils also suffer from. Eddy currents can 
also generate unwanted magnetic fields that can perturb the beam. The power supply for 
an AC magnet also has significant extra challenges which can limit how rapidly the fields 
can be changed. 


Eddy Currents 


Eddy currents are loops of electrical current induced in a conductor by a changing magnetic 
field. The name comes from the analogy to water forming eddies or whirlpools in areas of 
turbulence. The induced current is due to Faraday’s Law which states that the voltage, V, 
induced in a loop of conductor in a region of varying magnetic field is given directly by the 
rate of change of the magnetic flux, ®, as 


V=-—. (4.47) 


Eddy voltages are induced equally in all conductors which experience the same time-varying 
fields, irrespective of the material, and so the magnitude of the eddy current depends in- 
versely on the resistivity of the conductor. This means that non-magnetic materials, such 
as copper and aluminium, and low relative permeability materials such as stainless steel 
that is often used for vacuum chambers, will all experience eddy currents to an extent that 
depends upon how conductive the material is. Since currents are flowing, there is an asso- 
ciated heating which can be very significant. The currents flow in the plane perpendicular 
to the magnetic field direction. 

To calculate the power deposited in a conducting material due to eddy currents we start 
from the simple conceptual layout of Fig 4.14. The magnetic field, B, is perpendicular to 
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FIGURE 4.14 Conceptual sketch for calcul 
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The maximum, or peak voltage, induced in the loop is therefore 


Vp = 2x Lw Bo. 


The resistance of the loop, R, depends 


upon the resistivity of the material, p, the cross- 


sectional area of the strip, hAx, and the length of the strip, L; 


and so the peak current in the loop will 
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Now we can calculate the peak power, Pp = IR, as 
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By integrating the peak power in the loop with respect to x, from 0 to a, we can determine 


the peak power in the full block; 
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Since the average value of sin? is 1/2, the average power loss per unit length in the block is 
a? Aw? Be 
6p 
where A is the cross-sectional area of the block, 2ah. So, a copper conductor, with resistivity 
of 1.7 x 1078 Qm, of cross section 10 mm x 10 mm, in a 1 T peak field oscillating at 50 Hz, 
will be absorbing a power loss due to eddy currents of 2.4 kW/m. This power loss scaling 
also explains why AC magnet yokes must be laminated in the xy plane that is parallel to 
the magnetic field, with an insulating coating between laminations. The thickness of the 
laminations, 2a, is selected to be small enough to ensure the power loss is manageable yet 

not so small that the number of laminations per yoke becomes unwieldy. 


Pioss = ; (4.48) 


Hysteresis 


Hysteresis describes the dependence of the magnetic properties of the steel yoke on its past 
history. Fig 4.15 shows the typical relationship between B and H for a nominal magnet 
steel. When the material is first exposed to a magnetizing force, B increases with increasing 
H along path a. At sufficiently large values of H, the increase in B levels off and we say 
that the material is saturated. Now, if the magnetizing force is reversed, B follows path b. 
When the magnetizing force reaches zero, there is still a magnetic field in the material, 
and in the air gap of any magnet built using this material. If we continue to reverse the 
magnetizing force, then the material will again reach saturation at the opposite polarity. 
If we then increase H back towards zero, then the material follows path c and at zero the 
material is again magnetised but with opposite polarity to previously. The field in the air 
gap at zero excitation will be different depending upon which path the material has been 
taken through. This is why we must degauss magnets to ensure that the magnetic fields 
are repeatable for the same current in the coils. To be precise, degaussing implies that 
the remanent field in a magnet is reduced to zero but, in general in an accelerator, this 
is less important than repeatability from day to day, which can be achieved by following 
the same magnet excitation cycle and not necessarily requiring the reversing of the magnet 
polarity or the field at zero excitation being exactly zero. If the remanent field in the steel is 
required to be zero, then a comprehensive degauss process should be followed whereby the 
material is taken around the hysteresis path repeatedly whilst the magnet excitation levels 
are progressively decreased towards zero. Eventually the loops shrink in area until they are 
negligibly close to the origin and the remanent field is then zero. 

The hysteresis loop described by taking the material into saturation at both extremes 
defines the boundary of other possible loops that the material would follow if the material 
is not excited so strongly. The power loss in the steel due to hysteresis is proportional 
to the area enclosed by the loop for the particular excitation regime that it is subject 
to. The greater the extremes of the excitation, the greater the losses. The losses are also 
proportional to the volume of the steel yoke and the frequency with which it is excited. The 
losses also depend upon the material choice; not all steel alloys are the same. Steels with 
high silicon content (a few %) have significantly lower AC losses than the usual low-carbon 
steels that are utilised in DC magnets (such as XC06/AISI 1006 and XC10/AISI 1010). 
Steel manufacturers will provide measurements of AC losses for different grades of steel, 
in W/kg, usually at the transformer frequencies of 50 and 60 Hz. Note that these losses 
combine both hysteresis and eddy losses. 


Pulsed Kicker Magnets 


There are some accelerators which require very fast pulsed dipole magnets or kickers, such 
as for injection or extraction of beams. In these cases the magnets are off for most of the 
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FIGURE 4.15 Typical variation of magnetic field as a function of magnetizing force for steel. 


time and only fire as and when needed. Strictly speaking, they are in a different class than 
the AC magnets considered above since they do not necessarily follow a periodic waveform, 
but as they are certainly time varying, they have many issues in common with AC magnets 
and so are covered in this section. Kickers are characterised by very short pulse durations, us 
or even ns, and consequently can only have relatively low field strength (tens of mT). There 
are two common classes of injection and extraction kickers. The first one is purely inductive, 
with one or a few turns of conductor and with the power supply as close as possible to the 
magnet to minimise stray inductance. The second, which is capable of faster rise times, 
is a transmission line or delay line system which matches the impedance of the kicker 
(capacitance and inductance) to the line, and the rise-time depends on the propagation 
time of the pulse through the magnet. Matching the impedance is not trivial and can lead 
to complex and expensive kicker and line designs. Purely inductive kicker magnets follow 
the same design principles as for other dipole magnets, except that steel laminations are no 
longer practical and ferrites must be used instead. Ferrites are ceramics that include a large 
proportion of iron oxide and are ferrimagnetic so they are used to enhance magnetic fields 
in a similar manner to steel although they do have much lower saturation levels (hundreds 
of mT). Ferrites have very high resistivity, meaning eddy currents can be neglected and the 
hysteresis losses are small even at very high frequency. Kickers are often mounted outside of 
the vacuum to avoid increasing the impedance encountered by the beam and so in this case 
conductive vacuum chambers cannot be used as the eddy currents would be too severe and 
impact on the field level in the beam region, and so ceramic vacuum vessels are employed 
which have a thin conductive coating on the inside to provide a conducting path for the 
beam image current. 
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Pulsed Septum Magnets 


Septum magnets are typically combined with kickers to form complete injection or extrac- 
tion systems. Kickers can differentiate beams in time by pulsing so fast that only the beam 
that should encounter the field does encounter the field. The very fast pulses required from 
a kicker limit the magnetic fields that they can generate. Septum magnets differentiate be- 
tween beams spatially; they generate a strong dipole field to steer the injected or extracted 
beam but have zero field a very short distance away (~5 to 10 mm) so the stored beam is 
unaffected. There are DC and pulsed versions (the pulses are typically tens of us); this sec- 
tion will briefly discuss two pulsed types. The first, with cross section illustrated in Fig 4.16 
(a), has a simple C-shaped yoke, fabricated using thin steel laminations, and a single turn 
coil; the return leg of the coil separates the high-field region from the (very close to) zero 
field region. To achieve the required field levels (~1 T, say) the current in the single turn 
must be very large and so cooling of the coil is required but challenging, especially in the 
return leg which is desired to be as thin as possible so the physical separation between the 
two beams is minimised. The alternative design, with cross section illustrated in Fig 4.16 
(b), has a simpler coil arrangement wound around the back leg of the yoke, making cooling 
more straightforward, and then a passive conducting screen separates the two magnetic field 
regions. When the coil is pulsed, eddy currents are induced within this screen which then 
act to shield the field. The eddy current screen is very effective with field levels less than 
1% of the main field being achieved [6]. If lower leakage fields are required, then the eddy 
current screen arrangement can be extended to create a full return box around the magnet 
and a thin steel magnetic screen also added on the outside of the eddy current screen in the 
critical region, in which case field levels less than 0.1% of the main field are achieved [7]. 
The eddy currents themselves do not disappear as soon as the current pulse has reached 
zero current; they decay on a timescale set by the resistivity of the material within which 
they are flowing. 


4.5 Permanent Magnets 


Electromagnets are the standard solution employed for the vast majority of particle ac- 
celerators with permanent magnets (PMs) being used in some significant but still niche 
applications, such as for undulators in accelerator-driven light sources. There is currently 
an increasing interest in the application of PMs to more mainstream solutions for common 
magnets such as dipoles and quadrupoles. Clearly, PM-based solutions are only useful in 
generating static magnetic fields (equivalent to DC), not time-varying ones, although it 
should be recognised that the field is not necessarily fixed, and many designs exist which 
enable adjustable dipole and quadrupole fields using PMs. One reason for this increasing 
interest is that PMs do not consume any electricity in coils or associated cooling water in- 
frastructure and so the electrical power demand, and hence the operating costs, of a facility 
can be significantly reduced. A second reason is that PMs are very powerful and they can 
generate very strong magnetic fields, competitive with normal conducting electromagnets, 
and when physical space for the magnet is tight they can often exceed the capabilities of 
electromagnets. Other advantages are that since no cooling water is required, then a po- 
tential cause for magnet vibrations can be eliminated, no high-precision power supply is 
required, and they are extremely stable and reliable from day to day as there is very little 
that can fail. The Fermilab Recycler ring is a 3.3 km antiproton storage ring which was 
and remains the first large-scale accelerator project built where all of the main magnets are 
based upon PMs and not electromagnets [8]. The ring has operated for many years now 
and the experience has been very positive [9]. 
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FIGURE 4.16 Cross section of two types of pulsed septum: (a) is a direct drive version and (b) is an 
eddy current version. 


Permanent Magnet Materials 


A material is said to be a permanent magnet or magnetically hard if it can independently 
support a useful flux in an air gap of a device. PMs are ferromagnetic and as such they 
have a characteristic hysteresis loop (or BH curve); the loop for an ideal PM is shown 
in Fig 4.17. As the magnetizing force increases, the magnetic field increases with gradient 
uo. Remember that for magnetically soft iron alloys, the gradient is uruo where ur is the 
relative permeability of the material, which is non-linear with H and much larger than 
1, in general, until the steel is completely saturated. As the magnetizing force decreases 
to zero, the PM material remains fully magnetised and exhibits a strong remanent field, 
B,. The material remains magnetised and resists any negative H until large values are 
reached and then the PM effectively flips polarity. The value of H which is required to 
reduce B to zero is called the coercivity, He. The value of H where the flip occurs is 
called the intrinsic coercivity, H;, and this is a very useful number for comparing different 
grades of material since it effectively describes just how permanent the material is, which is 
naturally an important requirement! It should be noted that the ideal PM is linear in the 
second quadrant; this is important since this is the zone where the PM will be operating 
to deliver flux (+B) into an air gap (-H). All PMs are affected by temperature changes, 
with their values for B, and H, decreasing as the temperature increases. Since accelerator 
magnets operate at around room temperature (ignoring superconducting cryogenic magnets 
for now) there is little danger of the environmental temperature causing irreversible changes 
to the PM. A more important consideration is the temperature drifts which the PMs might 
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FIGURE 4.17 The variation of magnetic field with magnetizing force for an ideal PM. 


encounter within the accelerator facility since these can cause small magnetic field changes 
which could be important. Undulator magnets may be mounted in temperature-stabilised 
enclosures to counter this effect, although it is more common now for the full accelerator to 
be temperature stabilised and not just the undulators. A counter to the loss in field strength 
as the temperature increases is that if the temperature decreases the field will increase. This 
intrinsic effect is made use of now by undulators which are cryogenically cooled (often to 
77 K) in order to benefit from this increased magnetic field performance from the PMs. 
There are two common types of PM that are used for accelerator magnet and undulator 
applications, with a third type now starting to be used for cryogenic applications. All three 
types exhibit behaviours close to the ideal described above. The two standard types are 
samarium-cobalt (SmCo; or Sm2Co,7) and neodymium-iron-boron (Nd2Fe,,B). The third, 
new type is praseodymium-iron-boron (Pr2Fe;4B), which is favoured now when wanting 
to take advantage of enhanced magnetic properties by working at cryogenic temperatures. 
The other two types do benefit from increasingly enhanced magnetic properties as they are 
cooled but only down to an intermediate temperature of around 150 K, below which the 
magnetic properties degrade due to a spin reorientation effect [10]. Pr2Fe,4B does not suffer 
from this effect and so it can be operated at temperatures that are able to be maintained 
easily, such as the boiling point of nitrogen (77 K). Table 4.1 summarises the key features of 
the two main types of PM employed. The characteristics of praseodymium-based magnets 
are not so well established yet although the remanent field and the coercivity have been 
measured to be ~ 1.3 T and ~1500 kA/m respectively at room temperature and ~1.6 T 
and ~6000 kA/m at 77 K [11, 12]. Note that the relative permeabilities of these materials 
is very close to unity and so they behave magnetically in a similar manner to a coil in 
air, with a good approximation being that the fields from a group of PM blocks can be 
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TABLE 4.1 Comparison of the typical characteristics of the two main PM 
materials commonly used for accelerator magnets. 


Characteristic SmCo NdFeB 
Remanent Field at 300 K (T) 0.9-1.1 1.1 — 1.4 
Coercivity, He (kA/m) 600-800 800- 1000 
Intrinsic Coercivity, H; (kA/m) 1000 — 2000 1000 — 2500 
Maximum Energy Density (kJ/m?) 150 — 250 200 — 400 
Temperature Coefficient of B, (%/°C) —0.035 —0.10 
Maximum Operating Temperature (°C) 250 — 350 50 — 200 
Relative Permeability, ju, ~1.03 ~1.1 


added linearly to calculate the total field within a volume. In detail, the permeability is in 
fact anisotropic, it is marginally different in one plane of the material to another, and this 
might need to be taken into account when carrying out detailed magnetic calculations for 
a particular system. 

The parameter ranges associated with the magnetic characteristics listed within Table 
4.1 give an indication of the broad spread available by selecting different grades of the same 
basic alloy, using alternative additives, that the manufacturers are able to create to match 
the needs of a particular application. The manufacturers can in fact predict and control 
the parameters very precisely to meet the needs of the customer. The PM manufacturers 
publish catalogues which are available online, listing the various PM grades available and 
their precise characteristics. 

As well as the temperature variation of the material characteristics, which can potentially 
lead to unwanted magnetic field variations over time, there are two other issues which need 
to be considered. The first is the concept of aging and whether or not the characteristics 
of a material are likely to degrade over months or years and the second is that of radiation 
damage and whether or not a PM can and should be employed in an accelerator environment. 
The aging of PM blocks is difficult to quantify since it is affected by the grade, coercivity, 
the shape of the block, the magnetic circuit, the regular cycling or variation of the fields 
within the circuit, the temperature variation, and so on. However, in reality, since the 
accelerator environment is maintained at room temperature, and large coercivity materials 
are employed, aging over time is not a serious concern. When we procure PM blocks we 
request that they are all heated to well above room temperature, perhaps 50°C or so, for 
a few hours to ensure that any irreversible aging due to moderate temperature variation 
has already been built into the blocks. This temperature aging has a minor impact on the 
material characteristics and subsequently we have not observed any measurable variation 
of PM performance over a timescale of more than twenty years. 

Radiation damage to PM materials is a significant issue that must be considered when 
working within an accelerator environment. There are certainly several examples of PM- 
based accelerator magnets that have been directly affected by ionizing radiation or direct 
impact from high-energy particle beams. Most often these are undulators which have PM 
blocks very close to the beam axis (often only a few mm). The effect of the radiation is 
to degrade the magnetic properties of the PM material, locally leading to loss of magnetic 
field and poorer field quality. This is especially important for undulators which have very 
strict field quality requirements to optimise the synchrotron radiation output that they 
emit. However, the vast majority of undulators installed in accelerators have suffered little 
or no damage. If beam losses in the vicinity of an undulator are carefully controlled then 
undulators can operate extremely well for tens of years with apparently no loss of perfor- 
mance. We have measured the magnetic field in an undulator that was removed from a 
2 GeV electron storage ring after more than twenty years of continuous service and could 
measure no discernible degradation in the magnetic field levels at all. 
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There have been a wide variety of experiments exposing PMs to radiation in a number 
of different scenarios. An excellent literature review summarising the data has been written 
recently [13]. The results of the experiments show certain trends, but it is not possible to es- 
tablish clear quantifiable predictions based on the results. The general trends that have been 
established are that temperature is very important, experiments above room temperature 
show that damage is worse and cooling magnets to well below room temperature confers in- 
creased resistance to damage. The radiation resistance of SmCo is consistently shown to be 
better than NdFeB. Both of these effects are likely to be due to the higher relative intrinsic 
coercivity. It has also been demonstrated that the block aspect ratio (length to diameter 
ratio) makes a significant difference, as does the direction of the magnetization within the 
block (the easy axis) in relation to the direction of the beam generating the radiation. The 
broad conclusions supported by the studies show that radiation resistance is improved by 
using a grade with a higher coercivity, choosing SmCo over NdFeB (Sm2Co 17 being more 
resistant than SmCos), altering the shape of the magnet or the geometry to select a more 
optimal working point for the material, decreasing the temperature of the PM, pre-baking 
the PMs to thermally stabilise them, and positioning the PMs as far from the beam axis as 
feasible to reduce the dose and to allow the possible addition of radiation shielding. 


PM-Based Dipoles 


First we will consider a simple dipole to gain an appreciation for some of the basics of 
PM-based systems, using a similar approach to [14]. Returning to our derivation for the 
magnetic fields in a DC electromagnet, we found that the integral of B along a closed path, 
P, that bounds a surface S is given by the current flowing through that surface: 


B 
d= $ Hedl= f J-ds=Nz. (4.49) 
P Pro P S 
For our PM case, there are no external currents and so the integral equals zero: 
f H-dl=0. (4.50) 
P 


If we consider the geometry for the dipole of Fig 4.18, integrating around the path shown, 
then this becomes 
H-dl+ H-dl+ H-dl+ H-dl=0. 
P1 P2 P3 P4 


As for the electromagnet case, we will assume that the steel has huge relative permeability 
(H~ 0) and so we can neglect the integrals along paths 2 and 4; 


H -dl = — H.-dl, 
P1 P3 

which gives us 

HmLm = -Hg Lg, (4.51) 
where Lm and Lg are the lengths of the PM block and air gap respectively. Remembering 
that, in air, we have that H, = B4/Ho, so that 
Horb 

Ly 


By =— ko (4.52) 
We will also assume that any flux leakage from the steel away from the air gap is negligible 
and so the total flux flowing across the air gap will be the same as that flowing through the 
PM; 


7 


BmAm = BgAg; (4.53) 
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where Am and A, are the cross-sectional areas of the PM block and steel pole respectively. 
Substituting our result for B, into this gives 


HmLmAg 


An (4.54) 


Bm = —Ho 

The relationship between Bm and Hm also, separately, depends upon the material prop- 

erties (recall Fig 4.17) such that Bm = oHm + Br for the ideal PM material (a good 

approximation for the materials discussed earlier) which is perfectly linear, with gradient 

lo, in the second quadrant. This effectively gives us two simultaneous equations which we 
can use to solve for Bm, 

B,Lm Ag 


m= Lms 
LgAm(1 + Tea) 


(4.55) 


So, in our simplified dipole scenario, the magnetic field within the PM is determined by 
the physical size and shape of the PM and the air gap, as well as the remanent field of the 
PM itself. In the simplest case where the size and shape of the PM and air gap are equal 
(Lm = Lg and Am = Ag), then Bm = By = B,/2 and Hm = —H, = —B, /2po. To increase 
the field in the air gap we could double the length of the PM (Lm = 2L) and then we 
would get Bm = By = 2B, /3. If we want to increase the field further, then we could reduce 
the steel pole area to concentrate the flux, for example with Lm = 2L, and Am = 2A,, then 
the field in the air gap equals B,., whilst in the PM it equals B,/2. We should remember 
that these scenarios are somewhat idealistic, but they do demonstrate how the air gap fields 
relate to the material properties, as well as the size and shape of the PM block and air gap. 
By manipulating these parameters, we are changing the working point of the material, this 
is often shown graphically which can be instructive. The two simultaneous equations which 
we solved to find B,, are plotted in Fig 4.19. One line represents the intrinsic material 
properties and the other, called the load line, represents the physical layout of the system 
under consideration. The intersection of the two lines is called the working point. The safest 
region to operate in, in terms of avoiding unwanted demagnetization effects, is when the 
working point is nearer to the vertical axis (small negative H) since at large negative H the 
material is closer to, or possibly in, the non-linear region of the BH curve and any changes, 
perhaps caused by physically moving parts of the system or by temperature changes, can 
then be irreversible. The most efficient working point to choose is when the BH product in 
the second quadrant is maximised and the maximum magnetic energy is being utilised. For 
the ideal material this occurs at B,/2. 

The easiest PM-based dipole to design and build is one which has a fixed field, with 
no requirement for any magnetic field adjustability at all, except via manual intervention 
such as by changing the physical position of the PM or steel pole, perhaps. The actual 
design of a PM dipole depends, as for other magnet types, on the exact specification and 
constraints. There is considerable flexibility in the options available. When PM dipoles 
have been implemented in accelerators, some effort has been put into coping with (the 
relatively small) temperature effects. The issue being that due to the PM material properties 
(see Table 4.1), the dipole field will naturally reduce if the temperature increases. A good 
solution to this issue is to build a passive compensation scheme into the magnet itself. This 
can be achieved by using a second material within the magnet which also has a temperature 
dependence but in the opposite direction, so the two effects can be made to compensate for 
each other. A successfully demonstrated solution uses a Ni-Fe alloy which has permeability 
that varies significantly in the room temperature region [15]. The magnet design includes 
volumes of this material adjacent to the PM so that some of the magnetic flux is shunted 
or short-circuited away from the air gap. If the temperature increases, the PM naturally 
delivers less flux, but simultaneously, the Ni-Fe alloy permeability decreases and so less flux 
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FIGURE 4.18 A simple dipole magnet design driven by a PM block. 
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FIGURE 4.19 Graph showing the load line and working point of the PM. 
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is short-circuited and if the volume of alloy is chosen correctly, then the two effects will 
compensate each other over the working temperature range of the magnet. Improvements 
in the dipole field change, with temperature of around two orders of magnitude have been 
demonstrated for an NdFeB-based dipole, compared to the original level of —0.11%/°C, 
over a temperature range of more than 10°C [16]. It should be noted that inclusion of this 
passive temperature compensation scheme will lower the maximum magnetic field that is 
achievable since flux is being short-circuited. 

The development and upgrade of storage ring light sources towards diffraction-limited 
sources, in particular, has recently led to an increased interest in the widespread applica- 
tion of fixed-field PM-based dipoles to take advantage of their compactness compared with 
electromagnetic equivalents. A second consideration is that dipoles with an optimised lon- 
gitudinal field variation can offer superior electron or photon beam properties to the facility 
and that generation of this variation lends itself more naturally to a PM design [17, 18]. One 
study has shown that a 2 m long PM-based dipole with optimal longitudinal field profile 
was three times lighter than the equivalent electromagnetic version [19]. 

If small magnetic field adjustability is required for a particular application (a few percent, 
say) then it would be reasonable to include electromagnetic coils in the system to provide this 
range of variation. It should be noted that since the PM material has a relative permeability 
~ 1, then it effectively acts like an additional air gap within the system. This means that the 
coils are less effective than we would normally expect compared to a standard electromagnet 
which only has an air gap in the region of interest. This can mean that quite powerful coils 
are required for relatively small adjustments. 

If large magnetic field variation is needed from the dipole magnet (more than ten per- 
cent, say) then the only really sensible solution is to physically move parts of the system. 
Solutions exist which move steel components [17] and others that move the PM [20]. These 
two example designs are presented schematically in Fig 4.20, although many other design so- 
lutions are possible. Dipole field variations of a factor ten have been demonstrated although 
even more variation would be possible for both concepts. The forces that are present in 
strong magnet systems can be very significant and so the motion system needs to be able 
to cope with these whilst simultaneously not affecting the precise location of the poles to 
ensure sufficient field quality is maintained as the field is adjusted. 

Assembly of PM-based magnets raises new challenges compared to electromagnets. The 
most significant extra challenge is, of course, handling of the PM material which cannot be 
‘turned off’. Great care has to be taken at all stages of assembly to ensure that the attractive 
forces between the PM and the steel yoke and other magnetic items, such as fasteners and 
parts of any motion system, are considered. Assembly by hand is generally impossible as 
the forces are far too high to cope with, and so special fixtures must be designed and built 
so that the items can be brought together to build the magnet in a safe and controlled 
manner. Non-magnetic tools, typically made from a Cu-Be alloy, must be used at all times. 


PM-Based Quadrupoles 


Quadrupoles with fixed gradient strength have been realised in a number of different formats. 
The most common format used is the Halbach type, which simply consists of a ring of PM 
blocks [21]. In fact, this format can be used to create any multipole type, simply by adjusting 
the remanent field direction of the blocks to suit the type desired. The number of segments 
per ring, which is independent of the multipole order itself, is for the magnet designer to 
choose, typically being a compromise between complexity and achievable gradient and field 
quality. There are also hybrid versions which include steel in various configurations [22]. 
Examples of a PM-only version and a hybrid version are shown in Fig 4.21. 

PM quadrupoles with variable gradient have been developed by several groups [22] with 
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FIGURE 4.20 Two example concepts for PM-based dipoles with large field adjustability. Option (a) 
has two steel plates which move symmetrically as a pair. As the plates approach the PM, shown in grey, 
they short circuit the flux and so reduce the field at the beam. Option (b) slides the PM in and out of the 
steel yoke region to alter the flux path and reluctance through the steel and so the field at the beam. 


FIGURE 4.21 Two example concepts for fixed gradient PM quadrupoles. Option (a) is a classic Halbach 
design with 12 PM segments with magnetization direction shown by the arrows. Option (b) is a hybrid 
variant with steel poles shown in grey. 
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many different designs being produced. Each design has been optimised to achieve different 
characteristics and so there is not one design which meets all needs. Some designs can 
change the gradient by more than an order of magnitude, others are optimised for maximum 
gradient, others for large good field region, and so on. As for the dipoles, if very modest 
gradient adjustability is needed then coils can be used. If significant adjustment of the 
gradient is needed, then some parts of the quadrupole must be physically moved to alter 
the gradient. Typically, PM blocks are moved linearly or rotated (to alter the magnetization 
direction) to adjust field levels. For linear motion systems, where the blocks are moved away 
from the beam axis to lower the gradient, the minimum gradient is determined by how far 
the blocks are moved and in principle the gradient could be zero if they are moved far 
enough away. Other examples use an outer steel shell to short circuit the PM blocks to 
lower the gradient more rapidly for the same physical motion. Three different examples for 
adjustable quadrupoles are sketched in Fig 4.22. 


PM-Based Undulators 


The use of PM-based dipoles and quadrupoles is certainly not yet mainstream, although it is 
becoming more popular as stronger fields from more compact magnets are required, as more 
complex field shapes are required, and also to reduce the electrical power consumption of 
accelerator facilities. However, there is one area of accelerator magnets where PMs definitely 
are the mainstream and that is in the application of undulators for generating light from 
relativistic electron beams. Storage ring light sources are one of the most common advanced 
accelerator applications globally, and free-electron lasers are also now developing at a pace. 
Both of these types of light source rely on undulator magnets to generate the light in 
these world-leading X-ray sources for researchers. The vast majority of undulators have 
been, and continue to be, based upon PMs. An undulator is essentially a device which 
generates a periodic magnetic field, most commonly the field is in the vertical plane and 
it varies sinusoidally in the longitudinal direction, such that as an electron travels through 
the magnet it oscillates horizontally from side to side about the beam axis, emitting light 
in the forward direction. Undulators are built to enhance the light through constructive 
interference, much like a periodic diffraction grating, and more details on the properties of 
the light that is generated are given in Chapter 6. The wavelength of light which is observed 
in the forward direction depends strongly on the electron energy, but also on the period of 
the magnetic field and peak strength of the field. In essence, high magnetic fields and short 
magnet periods are optimal as these create the shortest wavelengths possible for a given 
electron energy. Magnetic fields of the order of 1 T at periods of only 20 to 30 mm are typical. 
These levels are impossible for electromagnets (unless they are superconducting!) since the 
physical space available for the coils with sufficient Ampere-turns is just not available. It is 
at these small dimensions that PMs really excel. 

The simplest undulator magnet uses blocks of PMs laid out in two arrays, one above 
the electron beam, and one below the electron beam (see Fig 4.23 (a)). The magnetization 
direction of the blocks rotates by 90° each time and the vertical field generated is a very 
good approximation to a pure sine wave. Another very common design is shown in Fig 4.23 
(b) where steel poles are employed in a hybrid configuration. For the design which only 
employs PM blocks (so-called ‘pure PM’ undulator) it is possible to derive the magnetic 
field at the electron beam analytically. Assuming that the PM block heights are equal to 
half the period length, the peak on-axis field, B,,, is given by 


By, = 1.72B,e- 79/4, (4.56) 


The inclusion of steel poles, with non-linear permeability behaviour, in the design means 
that an analytical solution is no longer possible and so magnet design codes are needed to 
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FIGURE 4.22 Three example concepts for variable gradient PM quadrupoles, all shown at their maxi- 
mumn field gradient position. Option (a) is a concentric pair of Halbach rings with the outer ring rotating to 
adjust the quadrupole gradient [23]. Option (b) has a pair of PM blocks which are driven apart symmetri- 
cally about the beam axis and the outer steel shell short-circuits the field to lower the quadrupole gradient 
more rapidly [24]. Option (c) is a Halbach type with extra PM cylinders which are rotated in unison to 
alter the quadrupole gradient [25]. 
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FIGURE 4.23 Two common magnet designs for undulator magnets which generate sinusoidal vertical 
field variation in the longitudinal direction. Type (a) uses PM blocks only with magnetization direction 


varying as shown by the arrows. Type (b) is a hybrid variant with steel poles shown in grey. 


model the design to accurately predict the fields that are achieved. An empirical equation, 
equivalent to the pure PM one, for the peak field in an undulator using PM material (NdFeB) 
with remanent field of 1.25 T, is given by [26] 


2 
By. = 3.60 exp ( = 4.45" + 007%). (4.57) 
This equation is said to be valid over 0.3 < g/Xy < 3.0. Many similar empirical equations 
have been generated for different remanent fields, for undulators which generate fields in 
both transverse planes (elliptical undulators), and also for cryogenic devices. An excellent 
summary of these various equations is provided in [26]. An example comparison of the peak 
magnetic fields achievable in a hybrid and pure PM undulator, as a function of period, is 
shown in Fig 4.24. 


4.6 Superconducting Magnets 


The application of superconducting (SC) materials to accelerator magnets opens up new 
possibilities that would otherwise not be available to us. In particular, the generation of 
multi-Tesla dipole fields is essential for high-energy physics-focused accelerators to reach the 
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FIGURE 4.24 A comparison of the fields achievable in a hybrid and a pure PM undulator assuming a 


remanent field of 1.25 T and a magnet gap of 8 mm. 


highest possible proton energies in a (relatively!) small circumference facility. SC materials 
have zero electrical resistance in DC operation and so do not suffer from resistive heating. 
As a consequence they can carry extremely high current densities that enable dipoles of 
order 10 T to be fabricated. The main disadvantage of SC magnets is that the materials are 
only SC at very low temperatures, with accelerator magnets typically operating at 1.9 or 
4.2 K. This increases the complexity and the cost significantly and so SC magnets are only 
used when there is no clear alternative. The understanding, application, and engineering of 
SC materials and magnets is a specialist topic that many people have spent whole careers 
on. This section can only be a brief introduction to the topic, highlighting key features and 
differences to conventional magnets. There are some excellent textbooks on the subject (e.g. 
(27, 28]) as well as the proceedings of specialist accelerator schools (e.g. [29]) that the reader 
is invited to study for more details. 


4.6.1 Superconducting Materials 


The tried and tested SC material of choice is niobium-titanium (NbTi) because it is by far 
the easiest SC material to work with. It is ductile, easy to insulate, can be readily formed 
into wires of suitable dimension, is relatively inexpensive, and generally quite forgiving. It 
can be wound into coils as easily as we wind copper wire. Like all ‘Type II’ SCs it will 
remain SC so long as it is operated below its characteristic critical surface of temperature, 
magnetic field at the conductor, and current density. At a fixed operating temperature (e.g. 
4.2 K) the surface simply reduces to a line of current density against a magnetic field which 
defines the SC boundary of the material. At 4.2 K and with a field at the conductor of 
6 T, the maximum current density for NbTi is around 2000 A/mm? [30]. If, instead, the 
material is cooled further to 1.9 K, then the same current density can be attained at up to 
9 T. Remember that this is the magnetic field that the conductor is experiencing, not the 
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field in the air gap of the magnet. In general, the dipole field at the beam is larger than 
that at the SC wire itself. The critical surface defines the limiting boundary at which the 
SC becomes normal conducting (resistive) and so it makes good sense for magnet designers 
to give themselves some safety margin in order to be able to build reliable magnets that 
will operate at their defined specification for year after year. The more aggressive magnet 
designs, which try to push the state of the art, might choose to work at around 90% of the 
limit (the LHC dipoles run at 86% for their nominal field value of 8.3 T [31]) whereas less 
demanding SC magnets routinely built by industry (e.g. for Magnetic Resonance Imaging 
systems) are more likely to operate at around 50% of the limit. 

If higher-strength magnets are required, that cannot be fabricated by only using NbTi, 
then another option is to employ Nb3S5n which, due to a more favourable critical surface, 
is able to operate at high current densities at much higher magnetic fields. Whereas NbTi 
can sustain around 2000 A/mm? at 6 T at a temperature of 4.2 K, Nb3Sn can sustain 
2700 A/mm? at 12 T or 1450 A/mm? at 15 T [30], with even more impressive performance 
possible at 1.9 K. Unfortunately, Nb3Sn is much more difficult to work with and so is only 
used when absolutely necessary. Nb3Sn is a brittle intermetallic compound that is created 
by raising the constituents to high temperature (typically 650 to 700°C) for many hours 
and then brought back down to room temperature in a controlled manner (the exact recipe 
varies grade by grade and is provided by the SC supplier). The primary problem with this 
particular SC material is that due to its nature, the SC itself is rather brittle and so winding 
coils with the material causes major degradation to the SC properties. The way around this 
fragility is to wind the coils before creating the SC material itself. This is called the wind 
and react approach. The unreacted wire is ductile and can be wound in a similar manner 
to NbTi. However, once the coil is wound, it must then be heat treated for it to become 
SC and hence useful. The required reaction process adds additional complications to the 
engineering (coping with the large thermal expansion and subsequently handling of the coil 
in the brittle state) and precludes the use of common electrical insulating coatings which 
are unable to withstand the high temperatures. In short, the use of Nb3Sn adds extra risk 
to the magnet fabrication process but it does offer a route to significantly higher magnetic 
fields. 

In addition to these two materials there are several high-temperature superconducting 
(HTS) materials which are being actively applied to accelerators in some niche areas, such 
as for current leads in the transition range between room temperature and the magnet coils 
at 4.2 K or below, or being prototyped into coils for evaluation. Examples of these HTS 
materials are MgBo, Bi-2212, and REBCO (rare earth barium copper oxide). Further details 
on the relevant properties of these materials are available in [32]. 

One point to note is that the current density mentioned earlier is the value within the SC 
itself. However, in practical situations the SC material is not the only material present, and 
since the wire is then typically formed into a multi-wire cable, which necessarily includes 
physical gaps between the wires, and then formed into a coil, with further gaps, the average 
current density carried by the space occupied by the coil cross section can be much less than 
the actual peak value within the SC. To account for this filling factor in their calculations, 
magnet designers quote the engineering current density, which is simply the average current 
density flowing through the coil cross section. 

A second point to note is that the makeup of an SC wire is actually quite complex. It 
is formed of a large number of narrow filaments of continuous SC strand held in a copper 
matrix which supports all the filaments. The number of filaments in a single wire can range 
from tens to thousands and they can be only a few um thick in some cases. The copper 
is very important as it not only supports the SC, it also conducts the current and the 
heat when the SC is no longer in the SC state. This is a very dangerous and unwanted 
state because if high currents are flowing the wire will heat up rapidly and melt, so it is 
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essential that there is enough copper within the wire to transport the heat and the current 
temporarily whilst the high current is removed or diverted as quickly as possible. The wire is 
typically formed into a multi-wire Rutherford cable which has a transposition (an optimised 
wire ‘twist’) built in and a good packing factor. There are often twenty to thirty wires that 
form a cable. The advantage of working with a cable is that large currents can then be 
carried (thousands of amperes) and the winding becomes easier to handle (far fewer turns 
per coil). 


4.6.2 Coil-Dominated Magnets 


We use SC materials to generate dipole fields well in excess of 2 or 3 T. For example, 
the LHC main dipoles have a nominal operating field of 8.3 T (using NbTi) and the High 
Luminosity upgrade to the LHC is planning to use 11 T dipoles (using Nb3Sn). With such 
high fields, well in excess of the magnetic saturation fields of magnetic steels, it no longer 
makes sense to use steel poles to shape or enhance the fields. Instead, SC magnets are 
primarily coil based and rely upon enormous currents flowing to generate the required field 
levels. This is quite a different approach to that discussed earlier in Section 4.4 and leads 
to a radically different magnetic design. 

Following the approach of [28] it is easy to show that pure multipole fields (i.e. dipole, 
quadrupole, sextupole, etc.) can be generated in the xy plane by an arrangement of currents 
flowing parallel to the s (beam) direction. In the idealised case, the current flows in the s 
direction on the edge of a circle (which is in the xy plane) and so maps out a cylinder. The 
required current distribution for a pure multipole as a function of the azimuthal angle, 0, 
is given by 

I(0) = Io cos mð. (4.58) 


A pure dipole is generated inside the cylinder when m = 1, a quadrupole when m = 2, 
and so on. This type of magnet is therefore referred to as a cos-theta magnet and two 
example ideal cases are sketched in Fig 4.25. Of course, fabrication of such a design is not 
really practical and so approximations to the ideal case are made using so-called sector 
coils. In these designs the current density, J, within the wire or cable is constant and the 
coil geometry is set to approximate the cos 0 requirement. A simple sector coil for a dipole 
magnet is shown in Fig 4.26. The inner radius of the coil is r, the coil width is w, and the 
coil half angle is a. For this geometry the dipole field can be calculated analytically to be 
[30] 


_ 2S pow 


B sin a, (4.59) 


T 


where uo is the permeability of free space. We can see from this equation that the magnetic 
field at the beam scales with current density and coil width, but does not depend upon the 
radius. It can be shown [28] that this simple sector coil generates not just a dipole field but 
also higher-order multipoles (sextupole, decapole, and so on). Furthermore, if the angle, a, is 
selected to be 60°, then the sextupole term actually cancels to zero. However, the remaining 
multipole terms are generally considered to not be acceptable (the decapole is still a few 
percent of the main field, for example) and so this simple sector coil arrangement is not a 
good enough approximation to the cos @ ideal in practice. To overcome this, the solution is 
to include more degrees of freedom in the design by, for example, adding additional layers or 
by breaking the coil into more parts, or a combination of the two. This concept is illustrated 
in Fig 4.27 and has been used successfully by the very-high-field SC magnets employed in 
accelerators like the LHC. The field quality achieved in practice by this type of magnet is 
just as high as it is with the iron-dominated, lower-field, magnets discussed earlier. 


Magnets for Beam Control and Manipulation 153 
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1(0) = Ip cos O 


(b) 


1(0) = Ip cos 20 


FIGURE 4.25 Examples of ideal cos-theta magnets for generating pure multipole fields. The current is 
flowing into and out of the paper with the distribution as noted in the equation next to each type. (a) is a 
dipole, which has peak current at the mid-plane and zero current top and bottom, and (b) is a quadrupole, 
which has peak current in four locations and zero current in four locations. 


FIGURE 4.26 A simple sector coil approximation to an ideal cos-theta dipole. The current density is 
uniform within the coils, shown in grey. The current direction is into the page on the right (cross in a circle) 
and out of the page on the left (dot in a circle). 
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FIGURE 4.27 An illustration of how additional degrees of freedom can be added to the sector coil 
concept by adding extra layers and splitting layers into parts. The dotted lines highlight just three of the 
angles which the magnet designer can optimise to ensure the magnet field quality is sufficient. 


Inclusion of Steel 


Whilst steel is not used to shape the field using pole pieces in these very-high-field magnets, 
it still has a useful role to play in confining the field to the magnet and so preventing stray 
fields which can otherwise be quite disruptive to an accelerator facility. To achieve this 
magnetic shielding, a steel yoke surrounds the coils in the form, at least conceptually, of a 
thick hollow cylinder. This steel cylinder must be of sufficient thickness that the steel is not 
saturated. A simple calculation for the 8.3 T LHC dipoles shows that the steel must be at 
least 100 mm thick [30]. 

The steel cylinder also has the additional benefit of acting as a virtual coil because 
of image currents within the yoke. These image currents, which make up the virtual coil, 
increase the magnetic field within the region of interest. Since the virtual coil has a much 
larger cross section than the actual coils, the image current density is reduced and the 
impact on the field is similarly reduced. Nevertheless, the steel yoke surrounding the LHC 
main dipole coils increases the field by 17%, and this increase seems to be relatively typical 
for such magnets. The effect of the steel yoke on the field quality should also be taken into 
account since inner and outer shells will be inverted in the virtual case. 

There are some circumstances when it makes sense to also include steel poles to shape the 
fields in an SC magnet. This type of design is called superferric. For the steel to determine 
the field shape it must not be fully saturated, and so this implies lower magnetic fields 
than considered above, perhaps in applications such as correction dipoles or higher-order 
multipoles. 
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FIGURE 4.28 Sketch showing the direction the forces are acting on a pair of dipole sector coils. 


Practical Considerations 


We already noted in Section 4.4 that the forces on a coil depend upon the current density 
and magnetic field at the coil. Since both of these parameters are very large in an SC magnet 
it is no surprise that the forces acting on the coils can be enormous. If part of a coil moves 
as an SC magnet is powered then the energy released can be sufficient to cause a quench 
(become resistive) since the heat capacity of materials close to absolute zero is very low and 
so even very small releases of energy can be enough to raise the temperature locally above 
the critical surface. Displacement of a coil or part of a coil will also impact the field quality. 
Typically, the tolerance on the placement of wires in such a magnet is to within 0.1 mm 
[30] and so even small movements can be quite detrimental. 

Handling of the forces exerted on the coils in high-field SC magnets is therefore a major 
challenge. In some cases the forces are so high that they are beyond the material yield 
strength and so plastic deformation is a real concern that must be addressed. First, we will 
consider the direction of the forces and then the countermeasures that are used to make high- 
field SC magnets viable. If we first apply the right-hand rule to a simple solenoid magnet 
we will see that the magnetic force is pushing the coil outwards radially away from the axis, 
creating a hoop stress in the coil. Now, if we consider the cos-theta dipole arrangement, we 
find that there is a radial force in the midplane pushing the coil away from the axis and that 
the parts of the coil away from the midplane are pushed towards it, as shown schematically 
in Fig 4.28. In the beam direction, the forces are acting to stretch the coil longitudinally, so 
overall the forces are trying to expand the coil, much like the solenoid case. Of course, the 
magnitude and direction of the force within any particular part of the coil depends upon 
the magnetic field strength and direction and so this rather simple picture presented here is 
actually much more complex in the real world. A more detailed analysis is presented in [33]. 
For an example 5 T dipole, the horizontal force is estimated to be 1 MN per longitudinal 
metre [28]. 

The solution employed to enable such forces to be handled by the sensitive SC windings is 
to apply a pre-stress to the coils to counter the magnetic forces. A radial inward compression 
force is generally applied by mounting the coils inside a pair of stainless steel or aluminium 
‘collars’ which are mechanically pressed around the coils and then secured with dowel rods 
to maintain this pre-stress on the coils. It should be remembered that this assembly activity 
takes place at room temperature but that the magnet is operated cold and so the different 
thermal contractions of the selected materials will also alter the pre-stress levels and must 
be taken into account. 
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Another practical consideration is making sure the magnet can cope with the rapid 
transition between SC and normal conducting that occurs when the magnet quenches. A 
quench will take place when the SC material passes through the critical surface of field, 
temperature, and current density, and is thought to generally be caused by a local release 
of energy due to the friction associated with very small movements of a wire or cable. A 
quench is a risk to the magnet integrity since a portion of the coil has become resistive 
and so can heat up rapidly considering the large current densities employed. Local failure 
of a winding is a very real possibility. For these reasons quench protection is taken very 
seriously and when a quench is detected, an electronic circuit will turn off the power supply 
as quickly as possible and the stored energy will be diverted into a secondary circuit which 
can handle the energy safely. Quench protection systems can be quite complex in detail but 
are essential for magnet protection since if a coil fails it is effectively scrap. 

A further practical consideration is the phenomena known as training. This describes 
the process by which an SC magnet very often improves in performance after successive 
quench events. It is common for an SC magnet to not reach the design magnetic field when 
it is first powered, and instead it will reach some intermediate level and then quench. At 
the second time of powering up, a good magnet will then quench at a higher field and so on. 
It is as if the magnet is learning (or training) how to cope with higher and higher currents 
and fields. The explanation for this behaviour is that small motions in the windings are 
taking place to cause the quench and that the coil is then locally in a more stable position. 
Good-quality magnets will retain a memory of being trained and so when they are warmed 
up to room temperature and then cooled back down, they will not need to be trained again. 
The number of quenches needed to attain design specification is hard to predict but ten or 
twenty would not be unusual. 


4.6.3 SC Undulators 


Short period SC undulators can generate higher magnetic fields than the permanent mag- 
net (PM) undulators discussed in Section 4.5, but PM undulators remain the mainstream 
solution with only a handful of SC examples being used routinely in accelerator-based light 
sources [34, 35, 36]. The reason that SC undulators are still not the first choice option is in 
large part due to the extremely successful track record of PM undulators and their ongo- 
ing improvement rather than any particular deficiency with SC undulators. Regardless of 
the progress being made with PM devices, there is still a significant benefit to be gained 
from using SC materials instead, and it is for this reason that several groups are actively 
developing short-period, high-field SC undulators [37]. The handful of examples that have 
been installed into light sources perform extremely well in terms of reliability and stability 
and there is no reason to doubt that SC undulators will grow in popularity in the future. 
Indeed, there seems to be a growing view that free-electron laser-based light sources might 
see the first major installation of these devices in large numbers [38]. As well as increased 
magnetic field, SC undulators are believed to be several orders of magnitude more resistant 
to radiation damage than PMs, which is especially important for high bunch repetition rate 
free-electron lasers. 

The magnetic design of SC undulators is very straightforward in concept, with most 
teams adopting very similar approaches, as illustrated in Fig 4.29. Two physically indepen- 
dent arrays of SC windings are fabricated on steel yokes and arranged in such a way that 
current flows transversely to the electron beam in an alternating arrangement such as to 
create the required periodic field. The two arrays are held apart by a non-magnetic fixture 
and are connected in series. Compared to the SC dipoles and quadrupoles discussed earlier, 
the forces and quench protection arrangements are much easier to cope with. However, the 
mechanical tolerances on the wire placement, yoke dimensions, and array separation, of 
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FIGURE 4.29 Sketch showing the side view of a section of a typical SC undulator design for generating 
vertical magnetic fields. 


better than a few tens of um, are difficult to achieve over the one- or two-metre length of 
the devices. 

As for the hybrid PM undulators which include steel poles, it is not possible to calculate 
analytically the magnetic field generated by an SC undulator because of the non-linear 
behaviour of the steel. Instead, scaling laws have been generated which have then been 
cross-checked against 3D magnetostatic simulations [26, 39, 40] to provide an estimate for 
the peak field in an NbTi undulator; 


By, = (0.3282 + 0.0678A,, — 1.053.1073A2 + 5.85.1076A3 Je 77u05), 4.60 
yo u u 


A comparison of the peak magnetic field achievable in an SC undulator fabricated with 
NbTi, that is operating at 80% of critical current density, compared against PM-based un- 
dulators is given in Fig 4.30. The figure clearly demonstrates the very significant advantage 
that the SC undulator has over the other options. In this example, the magnet gap is set for 
all devices at 8 mm. The aperture available for the electron beam is less than the magnet 
gap in a standard undulator, of either type, due to the need for a beam vacuum chamber 
within the magnet gap. In the PM case it is now common to put the magnets inside the 
vacuum system to remove the need for this beam vacuum chamber and so increase the 
magnetic field experienced by the beam as the magnet gap can be reduced for the same 
beam aperture. It should also be possible to have the SC undulator as part of the beam 
vacuum system, and so gain a similar benefit in the future, and at least one group is actively 
pursuing this option [41]. Similarly, the fields could be enhanced in the future by switching 
to Nb3Sn or one of the HTS materials, this is an active field that is developing quickly. 


158 


The Science and Technology of Particle Accelerators 


© 


ro 
e 


Ja 
© 


PM Hybrid 


Vertical Field (T) 
> bb 


> 
Un 


= 
© 


Period (mm) 


FIGURE 4.30 A comparison of the fields achievable in an SC magnet using NbTi against the PM 


alternative assuming a remanent field of 1.25 T and a magnet gap of 8 mm. 


Exercises 


What magnetic field is required to bend a beam of protons with a kinetic energy of 

800 MeV onto an arc of radius 7 m? 

If instead of protons we wanted to bend a beam of electrons, of the same kinetic energy 

on the same arc, what magnetic field would then be required? 

The LHC has a magnetic dipole field of 8.33 T for protons of kinetic energy of 7 TeV. 

What is the bend radius of these protons within the dipole magnet? 

If we instead stored electrons at 7 TeV kinetic energy in the LHC (ignoring any syn- 

chrotron radiation effects) what magnetic field would we have to set to ensure that they 

travel on the same bend radius? 

We want to design a normal conducting electromagnetic dipole with a magnetic field 

of 0.8 T and a gap between the poles of 40 mm. 

(a) If we assume that the steel has infinite permeability, how many Ampere-turns are 
required in each coil of the dipole? 

(b) If we set the number of turns in each coil to be 20, what will be the current flowing 
through the conductor? 

(c) We choose the conductor cross section to be 10 mm x 10 mm. What will be the 
current density flowing through the conductor if it is solid, with no integral water 
cooling channel? 

(d) The current density is sufficiently large that a direct water cooling channel is re- 
quired. We decide to limit the current density to 10 A/mm?. Calculate what cross 
section is now available for the water cooling channel and, assuming it is a circular 
channel, what the diameter of this channel will be. 


Magnets for Beam Control and Manipulation 159 


(e) Now, assuming that our dipole has a pole width of 100 mm and is 1 m long, estimate 
the energy stored in the magnet and then the inductance. 


(f) Finally, estimate the magnetic force between the two pole surfaces. 


6. We decide to also consider a permanent magnet dipole with the same peak field of 0.8 T 


and gap between the poles of 40 mm. Again the pole will be 100 mm wide and 1 m 

long. 

(a) Show that when the permanent magnet is used at peak efficiency, and the field 
within the material is B,/2, that the magnetizing force, H, within the material is 
H,/2. 

(b) Our selected permanent magnet material has B, = 1.2 T. Assuming the material is 
ideal, with relative permeability of one, what cross section, Am (refer to Fig 4.18), 
should the permanent magnet block have for it to operate at maximum efficiency? 

(c) If we choose the permanent magnet block to also be 1 m long, like the steel yoke, 
calculate the required height of the block, Lm. 

(d) If you are very keen, repeat this calculation of the permanent magnet block volume 
for a few alternative magnetic field levels within the block to satisfy yourself that 
maximum efficiency does correspond with minimum required volume of material. 
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Now that we understand how to accelerate particles in an accelerator and how to produce 
the magnetic fields that steer, focus and manipulate the bunches, we can turn our attention 
to the dynamics of the transverse motion. We shall learn what these cavities and magnets 
do to a charged particle beam and how we can design magnet layouts to achieve the goals of 
our machine. We’ll look at the basic equations governing the motion of charged particles in 
an EM field and the consequences for accelerator builders. In this chapter we shall focus on 
single particle motion, so the interaction between particles (collective effects) is considered 
in Chapter 7. 

We shall start by some general consideration of the motion, observing that the rate 
of oscillations in the transverse plane is larger compared to the longitudinal plane in a 
strong focusing accelerator. * After a discussion of various ways of ‘doing’ dynamics we 
shall consider Hill ’s equation and explore the consequences, leading to linear single particle 
dynamics and the Courant-Snyder formalism. Following this we turn our attention to the 
real-life situation, when things are not quite ideal. This starts with magnetic imperfections, 
focusing on the linear case, and moves to a beam of charged particles with a spread in 
momentum. This gives us dispersion, chromaticity and momentum compaction. Finally we 
consider the motion of beams of non-interacting particles and an introduction to non-linear 
dynamics. 


*This is not necessarily true in weaker focusing or lower-energy accelerators. 
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5.1 Preliminary Considerations 


Let’s start with a thought experiment! If you have ever visited a particle accelerator, cast 
your mind back to that visit. Or close your eyes and imagine you are flying towards, and 
then inside a particle accelerator. Ideally this would be a particle accelerator ring. Perhaps 
you have a favourite one! Once you are there, take a good look. What observations do you 
make and what would impress you? After you have thought about this, we will tell you our 
observations. 

So, what did you come up with? We thought the following features were worthy of note: 


e The particles spend a long time in the ring. In many rings this can be many hours. 
The fact that the particles do not move to very large amplitude and touch the machine 
aperture means their motion is stable. In the LHC the protons travel around the ring 
over 11,000 times per second and stay in there for many hours, whilst in Diamond the 
revolution frequency is 533.8 kHz. 


e The machine repeats itself, i.e. it is periodic. We can see this from the layout of the 
magnets. 


e The particle motion does not have the same periodicity as the machine on a particle- 
by-particle basis but the envelope of the motion follows the machine periodicity. 


You may have gotten the first one, though the second two are less obvious. These are 
observations we will study and explain using beam dynamics. 

There is a huge range of particle accelerators, from the very small to the very large Large 
Hadron Collider. These can often be classified by a small number of high-level parameters, 
and doing so is useful to compare machines and get a feel for scale and purpose. The first 
way to classify machines is in terms of the type of particle accelerated and its geometry. For 
example, the CLARA accelerator at Daresbury is a linear (single pass) electron accelerator 
and the Large Hadron Collider is a circular (many pass) proton accelerator. Following this, 
the beam energy, in terms of MeV, GeV or TeV for example, gives the energy scale of the 
accelerator and the current (or bunch charge) gives the scale of the number of particles 
accelerated or stored. The design beam energy at the LHC is 7 TeV and each as-designed 
beam stores 0.5 amperes of proton current. At Diamond the electrons go around the 562 m 
ring 534,000 times a second. The maximum beam current is 300 mA. Following this there 
are a myriad of accelerator parameters used to discuss, compare and classify the acceler- 
ators. For example, the colliding beam luminosity in the case of a collider and the beam 
lifetime in the case of a storage ring. The calculation and evolution of these parameters 
is something we can compute using single and multi-particle dynamics. We perform beam 
dynamics calculations and modelling to understand the motion of particles in linear and 
circular accelerators, to understand the fundamentals of existing machines, optimise and 
commission accelerators, design new machines, e.g. a new collider, and design novel ma- 
chines, e.g. a non-scaling FFA. So the science of beam dynamics is central to making and 
operating particle accelerators. How do we do this? The fundamental tool of a person en- 
gaged in beam dynamics is knowing how to calculate the motion of a charged particle in 
a real electromagnetic field, which includes motion in magneto-static configurations, what 
happens in a time-dependent field, computing charged particle optics, understanding the 
approximations used, how the particles interact with the surroundings, and whether the 
particles in a bunch interact with themselves. For now, we shall concern ourselves with 
single, non-interacting particles, starting initially with static magnetic fields and bringing 
in time-dependent electric fields later in the chapter. We shall then worry about what it 
means to have many interacting particles in Chapter 7. 


Single Particle Motion 165 


—_— re 


Ten perature 


M omma 


FIGURE 5.1 The global view of a gas in terms of macroscopic variables such as pressure (P) and 
temperature (T) (left) and the local view of a collection of gas in terms of gas molecules (right). 


The most basic question we can ask is: how do I represent the beam passing through 
the accelerator in my beam dynamics language? This leads us to a hierarchy of beam 
descriptions and in the course of our analysis of beam dynamics we will use different, but 
related, descriptions of the beam. A useful way to think of this is in terms of a microscopic 
or a macroscopic description, with the latter only using a few, global parameters. A useful 
analogy is a box of gas of some substance, as shown in Figure 5.1. We can think of this 
system as being described by several numbers: the pressure (P), the temperature (T), the 
volume (V), the number of moles (n), etc. An equation of state relates these quantities 
together, and, in the case of an ideal gas, we have the ideal gas law 


PV =nRT, (5.1) 


which relates the state variables to each other, R being the ideal gas constant, and tells us 
how they change. This gives a description of the gas in terms of a few variables. 

Our gas is also made up of a collection of gas molecules, each with a position and a 
momentum in every degree of freedom of the system. Each molecule has a speed v and a 
kinetic energy (translational energy). This is a microscopic view of our gas in a box, and 
an equally valid way of thinking about the box of gas. The two pictures are related in a 


fundamental way 


3 
U = 5kT, (5.2) 


with U (the average kinetic energy) directly proportional to the macroscopic temperature 
of the gas T, k is the Boltzmann constant. Hence the microscopic (particle) view and the 
macroscopic view are related, as they should be, as we’re talking about the same box of 
gas. Both views can be useful to understand the system. It is common in physical systems 
to have several different, but equivalent, views of the same situation, e.g. physics of an ideal 
gas, quantum mechanics, with wave and matrix formulations. We have this situation in 
beam dynamics. 

The first view is the global view where we assume a ring or beam line exists as an object 
and study the global properties of the system. For example, the stability of the beam or the 
number of oscillations per turn (tune, which we shall discuss later in this chapter). 
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FIGURE 5.2 The standard problem of a harmonic oscillator in one dimension, where the restoring force 
is proportional to the distance from the equilibrium position. 


Then we have a local view, where we worry about the details of the machine and think 
about individual particles. We need to think about which frame of reference is best and 
what the fields look like in this frame. We then can ask how a single particle moves in 
this system. As an aside, in this book we shall use the words machine, ring and lattice. 
By machine we mean any complete accelerator system, for example the LHC or Diamond. 
By ring we mean a closed arrangement of dipoles forming a repeating path for the particle 
beam, and by lattice we mean any general arrangement of magnets appearing inside an 
accelerator. 

So we have different ways of looking at a beam. There are also several ways of doing 
particle dynamics. These ways are equivalent to each other and all can be used to solve 
dynamical problems. The three formulations of dynamics are 1) Newtonian dynamics 2) 
Lagrangian dynamics, and 3) Hamiltonian dynamics. The one you should choose depends 
on the kind of problem you are solving. In accelerator physics we tend to use Newtonian 
and Hamiltonian dynamics, and each one has its own merits. 


5.2 The Dynamics of a Simple Harmonic Oscillator 


In this section we shall see that the transverse motion of a particle in an accelerator is very 
similar to the motion of a simple harmonic oscillator and we can learn a lot from thinking 
about this similarity. 

Let’s work in one dimension, denoting the position from some equilibrium point by 
x(t) and the velocity by «(t), as shown in Figure 5.2. If the restoring force is given by 
F(t) = —ka(t), called Hooke’s law, where k is the spring constant, then the coordinate x(t) 
obeys 

#(t) + wea(t) = 0, (5.3) 


where we write wĝ = k/m, with m denoting the mass of the oscillating particle. This is 
just using Newton’s second law with Hooke’s law. Note that in reality Hooke’s law is only 
approximately true for a real spring and experimental measurements show that most springs 
have higher-order non-linear terms in the restoring force. Our intuition, and experience of 
masses on a spring, tell us the solution should be oscillating (or diverging), which we can see 
by substituting a trial solution x(t) = exp(At) where A is some constant, into the equation 


of motion. This gives \1,2 = +iwọ and a general solution of 
x(t) = Acos(wot) + Bsin(wot) 
= Csin(wot + ¢). (5.4) 


The amplitude C and phase ¢ depend on the initial conditions, unlike the natural angular 
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frequency of oscillation wọ, which depends on the spring and the mass that is oscillating. 
Note if we replace the sign of the force in Hooke’s law we replace the sine and cosine in 
the solution by sinh and cosh, and obtain hyperbolic diverging solutions (if you are not 
familiar with sinh and cosh, spend some time reading about them, as we shall use these 
functions again). These ideas will come back for our description of beam dynamics, and 
the observation that we can obtain both oscillating solutions or diverging solutions by a 
restoring force proportional to distance from an equilibrium point and by flipping the sign 
in the force. This makes physical sense as the force now pushes the mass to large values of 
x for x > 0 and the force pushes the mass to smaller values of x for x < 0. 

It is a standard plan of attack when solving differential equations to rewrite a second- 
order differential equation as two first-order differential equations and, if we did so, we could 
write the solution in terms of two constants, or invariants, of the system. The first one is 
linked to the total energy, thus determining the size of the motion (and can be linked to 
amplitude), and the second one appears as a phase in the harmonic function describing the 
motion. These constants are both linked to initial conditions, and where we release the mass 
(or for an accelerator later in this chapter, our particle). 

We shall see the appearance of such invariants in our study of transverse motion, and 
they shall prove to be very important in our study of beam dynamics. The language of 
invariants is particularly powerful in physics and engineering and gives a useful framework 
to understand and predict motion. The invariant of the motion using Newton’s formulation 
of dynamics emerged after a bit of work. This structure is very clear if we tackle dynamics 
problems using an alternate formulation first proposed by Hamilton, which we discussed 
when we thought about formulations of dynamics in the previous sections. As we saw, we 
shall take the approach of Newton and use forces in this book to make the physical results 
clear, but let’s take a short diversion and consider our harmonic oscillator from the point of 
view of Hamilton. In this framework the central object is the Hamiltonian, which contains 
the physics of the system and is formulated in terms of coordinates and their corresponding 
canonical momenta. Together these form something called a conjugate pair, and we have 
one pair for each dimension of the system. For our one-dimensional oscillator we have the 
position x and the canonical transverse momenta py. Once we have the Hamiltonian (which 
we will do in a moment) we use Hamilton’s equations to figure out the motion, 


de _ OH 
dt pr’ 
dpr OH 
ae ae (55 


Notice that we have two first-order differential equations to solve, instead of a single second- 
order differential equation in Newton’s approach. 
The Hamiltonian for the oscillator is given by 


1 
H=} “mwer? , (5.6) 


where m is still the mass of the particle. This is really just the sum of the kinetic and 
potential energies and, in the presence of only forces that are constant in time we can prove 
this sum of terms is conserved. This is easily done by taking the total derivative of H and 
using Hamilton’s equations, and we shall leave this as an exercise for the keen reader. It 
means that 

dH 


GP =: (5.7) 


168 The Science and Technology of Particle Accelerators 


and H is an invariant. Now, directly applying Hamilton’s equations gives 


ae. Be 

dt =m’ 

dps 

= —muen, (5.8) 


which is what we obtained using Newton’s approach with forces. We will not use Hamil- 
tonians for the study of transverse motion but there are many good references [1] and the 
approach is very useful for studying non-linear motion. 


5.3 Hill’s Equation 


We have just seen that we can obtain either converging (oscillating) or diverging solutions by 
using Hooke’s law of a restoring force for an oscillator, which states that the restoring force 
is proportional to the distance from the equilibrium position. This is exactly the behaviour 
we shall see in our focusing quadrupole elements and we will show this can be used to obtain 
stable behaviour in both transverse planes. Quadrupoles have four magnetic poles and are 
the building blocks of our focusing lattices. The linearly rising magnetic field will give rise 
to the focusing of our beam. 

This feature will emerge from our fundamental equation of transverse motion, called 
Hill’s equation. The equation for the horizontal motion, with coordinate x, is 


1 
ale) + (tat) +5) =0, (5.9) 
and the equation of the vertical motion, with coordinate y, is 
y” (s) + ky(s) = 0. (5.10) 


In these equations s is the longitudinal distance along our accelerator beamline, kz,y(s) 
denotes the momentum-normalised focusing strength and p is the bending radius, as defined 
in the figure. We shall define these quantities more carefully later. Note we’ve written out 
the explicit dependence of the functions on s, which can be dropped for brevity once we 
know what is going on. 

It’s worth spending some time to understand the features of these equations, as they will 
govern our study of transverse dynamics, at least linearly. What does this last statement 
mean? Before we answer that, let’s think about coordinate systems. We shall not present a 
complete derivation of Hill’s equations for accelerator physics — this can be found in all the 
standard textbooks and we learn nothing significantly useful if we present it in this book. 
For very good treatments see [2, 3]. In this chapter we work with transverse coordinates, 
so in the horizontal plane we use x, which is the horizontal position and we use y for the 
vertical position. The question then is, what are x and y relative to? To understand this we 
need to look at the coordinate system. 

The coordinate system we use is shown in Figure 5.3 and forms the basis for the analysis 
in this chapter. We are developing the equations of motion in a linear or circular machine, 
and our equations work in both situations. For the circular case, the curvature is provided 
by a set of dipoles, which define a curved trajectory through the tunnel. The local curvature 
is denoted by p and the distance along this curve in the laboratory frame is denoted by s. 
Our coordinate system is often called a co-moving system and will move along the reference 
trajectory defined by the dipoles at the same speed as some reference particle. We then 
define all quantities, for example transverse positions x and y, with respect to this reference 
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FIGURE 5.3 Our co-moving coordinate system for describing the location of our particle in the accel- 
erator. The bending arises from the dipoles forming the geometry of the machine. 


particle. The curved reference trajectory is normally called the orbit, and the coordinate 
system moves with a reference particle around the design orbit defined by the dipoles. The 
co-moving system has the consequence that we won’t see the curve in the dipoles explicitly, 
only the focusing around the reference particle due to the bending. For the linear machine 
case, there is no focusing terms from the bending. 

So our coordinates represent deviations with respect to the design (ideal) orbit, and we 
assume these deviations will be small (x is normally around millimetres). The assumption 
that quantities like x are small will make our equations linear; we will discuss lifting this 
constraint later. For coordinates relative to this design orbit we use the position and slope 
dx/ds = x’, and 6 will denote deviations from the reference particle momenta. A transverse 
position vector in this frame then is 


R=ex+ yy, (5.11) 
where x and y are unit vectors in the co-moving frame and 
r=p+a. (5.12) 


Let’s sketch out the derivation. Once we’ve defined our co-moving coordinate system and 
understand what it means to differentiate position vectors, we are able to write down the 
left-hand side of Newton’s second law. We’ve one side of Newton’s second law and we need 
the other side. What is the force? This is the right-hand side, which we shall equate to the 
left-hand side when we figure it out. 

In the presence of electric and magnetic fields we use the Lorentz equation (or Lorentz 
force law), already seen in Chapter 2, 


F=q(E+vxB), (5.13) 


where v is the velocity of the charged particle with charge q. This physical law tells us, in 
vector notation, the force on a charged particle moving with velocity v from an electric field 
E and magnetic field B. For our purposes we shall assume the velocity in the longitudinal 
direction is far bigger than the transverse velocity, so the transverse velocities are small, 
as are quantities like x’. Equating this Lorentz force to m - xX gives a set of equations of 
the horizontal and vertical motion. There is also an equation for the relative longitudinal 
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motion but we’ll disregard this and approach motion in this plane in another way. It turns 
out this longitudinal motion is far slower than the transverse motion, which we shall see 
in Section 5.9 and means we treat these two kinds of motion differently. We can also make 
another assumption, that is, we are interested for now in dipole and quadrupole fields, and 
so the field which appears in the Lorentz force law can be written as 


B = Boyy + g (xy + yx). (5.14) 


In this equation, Boy is the dipole field which generates the curved coordinate system, and 
the second term is the quadrupole field we impose on the beam. The curl-free nature of the 
free-space Maxwell equations means g = 0B, /0y = OB, /Oz. 

We can now linearise these equations, meaning dropping any terms in any of the variables 
of order two and greater in any of the variables, i.e. we cross out any term looking like 2?, 
xa’, x°, x and so on. This is an approximation and means our equations describe linear 
motion and are valid for small values of the variables (so that x? < x etc). We also expand 
the momentum deviation 6, which appeared in the denominator due to Newton’s second 
law, and we write 

2 
ig PO) (5.15) 

We drop terms of 6? and higher, and so assume the quantity 5 is small. For now we shall 
also drop terms containing 6, and restore them when we talk about dispersion. Later, when 
we think about something called chromaticity, we shall restore terms like æ- 6. But for now 
our Hill’s equation to describe linear, on-momentum motion is 


2"(s) + (rc) + Tr) =i (5.16) 


in the horizontal plane and 


y” (s) + ky(s) =0 (5.17) 
in the vertical plane. We have defined some notation, and defined 
g 1 
kz = — + 5, a 
Bp + F (5.18) 
and the equivalent for the vertical plane, 
g 
ky =—-=. (5.19) 


Note the vertical plane only has a contribution from the quadrupoles through g, and there 
is a minus sign difference between the planes for the g terms. 

So we have our equations of motion. Let’s think about their features. To do this we write 
both equations very compactly as one equation by defining some notation. Let u = x,y for 
the variables, and wrap up the second term in each equation into a single function. Hence 
Hill ’s equations become 

u” + Ku=0, (5.20) 


where K = g/(Bp) +1/p? in the horizontal plane and K = g/(Bp) in the vertical plane. 
We can see now these equations look exactly the same as our harmonically oscillating 
mass on a spring. Imagine for a moment there was no 1/p? term in the equation for K in 
the horizontal plane. This would mean the quadrupole gradient sign sets the value of the 
restoring force, so that a positive g would focus in the horizontal plane and defocus in the 
negative plane. Therefore negative g would do just the opposite. We also see the implication 
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FIGURE 5.4 The natural focusing of particles in a constant magnetic field for clockwise moving particle. 
The left particle has a small transverse offset compared to the reference particle. The right particle has a 
small momentum offset compared to the reference particle. 


of Maxwell’s equations — a quadrupole designed to focus in one plane must defocus, or create 
a diverging trajectory, in the other plane. We can control the spring constant but with some 
constraints! We shall explore the role of focusing in the coming pages, but perhaps you 
can start to imagine what a series of magnets would look like if we need to somehow have 
confined motion in both planes. 

Let’s think about that 1/p? term. It’s acting like a focusing magnet but only in the plane 
of the bending, in this case the horizontal plane. This is called natural, or body, focusing and 
arises from the effect of the dipole magnets defining the reference trajectory. The natural 
focusing arising through bending can be visualised using the analysis of Figure 5.4. This 
figure shows the reference orbit as the darker line, and a particle moving with respect to this 
orbit as the lighter line. The left-hand figure shows what happens when the general particle 
has a small transverse position offset. If you follow it around the ring it oscillates around 
the reference orbit, with one complete oscillation per turn. The right-hand figure shows the 
general particle having a small momentum offset, and showing stable behaviour. Note there 
is no natural focusing in the vertical plane as we don’t bend in this plane. For a straight 
beamline this term is absent. As an aside, we should also mention that the length through 
which a magnet acts on the beam is often longer than its physical length of material due to 
field lines curving at each magnet end. This is called the effective length of a magnet. 

So Hill’s equations describe how the transverse coordinates x and y evolve as a function 
of distance through the magnetic lattice, and look like linear harmonic oscillator equations. 
We can use this fact to solve them easily, which means writing down explicit functions 
x(s) and y(s). Of course we can solve Hill’s equations in many ways, including numerically, 
but let’s pursue the approach most commonly taken in the literature. Let’s consider the 
horizontal motion and take the case K > 0. We know the solution is built of harmonic 
functions and contains two unknown constants, so let’s guess at 


a(s) = c,cos(V Ks) + c2sin(V K8), (5.21) 
where cı and cz are the constants fixed from the initial conditions. We can take the derivative 
a'(s) = —c,VK sin(V Ks) + VK cos( V K8), (5.22) 


and substitute into Hill’s equation to quickly check this is indeed a solution. To find the 
constants we note that x(0) = zp and 2’(0) = 29, giving 


cq) = To, 
Xo 


CQ = 5 (5.23) 


S 
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FIGURE 5.5 The differing quadrupole kicks obtained for different particle transverse offsets. 
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FIGURE 5.6 A quadrupole magnet, showing the four poles with coils wound around them to drive the 
magnetic field. ©) STFC 


and so ; 


ax(s) = xo cos(V Ks) + ® sin(V Ks). (5.24) 


This has a first derivative of 
x'(s) = —xoV K sin(VKs) + 24 cos(VKs), (5.25) 


so the kick given by the quadrupole magnet points back towards the origin (focusing) and 
gets bigger the further the particle is away from the origin. This focusing effect for off-axis 
particles is shown in Figure 5.5 and a real quadrupole can be seen in Figure 5.6. 

The equations evolving x and z’ can be written as a matrix equation, wrapping two 
equations into one and using linear algebra to express the linear nature of our system. 


Hence we can write 
=M, a (5 26) 
x! i quad a! ġ 
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for the evolution of the vector formed by x and x’ from position s = 0 to position s = s ina 
nicely compact way. This is just another way to write our two separate equations evolving 
x and x’ that we just discussed. The matrix Mguaa is given, for a focusing quadrupole, by 


vo ( cos(V Ks) Jz sin(V Ks) ) l 


—VK sin(V Ks) cos(V Ks) (5.27) 


In this linear formalism, the particle is represented by a point in (x, x’) space, known 
as trace-space. The matrix Mp acts on these trace-space vectors to evolve the particle in s. 
What happens if K < 0? Now we have the equation of motion 


x” — |Kļu = 0, (5.28) 


which has a diverging solution which can be written in terms of sinh and cosh functions. 
So we write 


ax(s) = cı cosh( V Ks) + cz sinh(V Ks), (5.29) 


where, as before, cı and cz are the constants fixed from the initial conditions. We can solve 
for the constants and write as a matrix equation as we did for the focusing case, giving 


vO ( cosh(./]K|s) Jr sinh VIR Is) 
+/|K|sinh(,/|K]s) — cosh(./|K|s) 


for the defocusing quadrupole. 

So a given quadrupole magnet focuses on one plane and defocuses in the other plane, by 
virtue of Maxwell’s equations. By convention we say a horizontally-focusing quadrupole is 
a ‘focusing’ quadrupole, conventionally known as an ‘F-quadrupole’. The focusing strength 
is related to the gradient of the magnetic flux density B by 


(5.30) 


2 AU og (5.31) 
Bp pdz 
A quadrupole with a positive sign for dBy/dx is horizontally-focusing, whereas a negative 
dB,/dz is horizontally-defocusing; the latter is conventionally known as a ‘D-quadrupole’. 
A drift space is a region of the beam line with no electromagnetic fields. We can figure 
out the evolution equations for x and x’ by either simple geometry or taking a limit of the 
quadrupole matrices for K — 0. Either way we find the variables change as 


(L) = x+2-L, 
GD). = fi (5.32) 


where the drift space has length L and (29,26) are the particle coordinates on entry to the 
drift space. This can be written as a matrix 


1 L 
Maritt = ( 01 ) , (5.33) 


telling us how a particle evolves in a drift, with a very clear geometrical interpretation. 
A useful approximation for the quadrupole matrix we already know (e.g. Mp) with finite 
length and called the thick lens matrices, is the limit when the focal length, f, of the 
quadrupole lens is long compared to its length, l. Hence we consider 


1 
f= Kl > I, (5.34) 
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which we find by letting l — 0 whilst keeping the product Kl constant; the product Kl is 
known as the integrated strength. This gives the quadrupole focusing matrix in this limit 
as 


1 0 
Minin = ( wi ) . (5.35) 
f 


This is the matrix for a horizontally-focusing quadrupole in the thin-lens approximation 
with a focal length of f. The kick towards the axis for a particle with a non-zero position 
with respect to the axis is clear. The matrix for the defocusing case is obtained by the 
transformation f — —f. The thin-lens approximation is useful for quick calculations of 
beamlines. 

We now have a linear matrix formalism for the evolution of a coordinate vector (x, x’), 
and know the matrix for a focusing quadrupole, Mguaa is given by 


f l g 
Mp = cos(V Ks) Jz sin(vV Ks) (5.36) 
—V/Ksin(/Ks) cos(vV Ks), 
a defocusing quadrupole, 
i cosh(/|K|s) Ta sinh(./|.K|s) an 
a= 
+4/|K|sinh(,/|K|s) cosh(,/|K|s), 
a drift, 
1 L 
Maritt = ( 01 ) f (5.38) 
and a thin lens quadrupole 
1 
ips ( T ) (5.39) 
F 


In a real accelerator we have lots of these elements arranged one after the other, as shown 
in Figures 5.7 and 5.8, with focusing quadrupoles, defocusing quadrupoles and intervening 
drift spaces. Take a close look at both these figures and try to spot the beamline elements 
we are discussing in this chapter. We shall look at lattices more closely later in this book. 
Each element is represented by a matrix, at least in our linear approximation. The 
question arises: how do we transform through these sequences of elements? The answer is 
intuitive and we shall not prove it. We multiply the matrices of each element to give an 
overall transfer matrix through the system, beginning with the start of the beamline on 
the right and pre-multiplying by the next element seen by the beam. So imagine we have 
a beamline consisting of a focusing quadrupole, followed by the drift space, followed by a 
defocusing quadrupole and followed by a drift space. This is the order of elements seen by 
the beam. The overall matrix for the transformation of the particle by the system is given 

by 
Meen = Marist © Mp - Marist © Mr. (5.40) 


Note the focusing quadrupole matrix sits on the right of the series of matrices in this 
expression; the overall transfer matrix is obtained by multiplying the individual transfer 
matrices in reverse order. We have also called the composite system a cell, for reasons 
which will become clear. The overall motion of a particle at the start of this system (defined 
as s = 0) to the end (defined as s = 1) is 


( a ) = Meen ` ( W E (5.41) 
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FIGURE 5.7 A beamline section for a so-called transfer line between two accelerators, showing an 
arrangement of focusing and defocusing quadrupoles (with four poles), dipoles (with two poles), and inter- 


vening drift spaces, mounted on a common supporting girder. © STFC 
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FIGURE 5.8 A beamline, showing an arrangement of focusing quadrupoles, defocusing quadrupoles and 


intervening drift spaces. © STFC 
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FIGURE 5.9 A sector dipole magnet, here designed to give a 60° deflection of 35 MeV electrons. Note 
the curve of the coils and poles, so that the electrons enter and exit perpendicular to the magnet end faces; 
this gives no edge-focusing effect. Rectangular magnets are also used, which do give edge focusing. © STFC 


We would encourage the reader to think now how they would turn this formalism into a 
simple particle evolution code (called a tracking code in the field). Under what circum- 
stances would your code give valid results? When you get some time, use your favourite 
programming language to write this code. 

The beamline pictured in Figure 5.7 also contains elements which bend the beam around 
the trajectory of the machine, and are the dipoles we used to define the curved reference 
trajectory. Do these have a matrix? The bending effect to define the reference trajectory is 
already included in our equation in the co-moving coordinate system but, as we discussed 
previously, this bending introduces some natural transverse focusing. This can be described 
by a matrix. To obtain it, we start from the matrix for a focusing quadrupole, Equation 5.27, 
in terms of K and note for a pure bending element 


1 
p 


K= (5.42) 

where again p is the bending radius. Hence we obtain the following matrix for a dipole of 
length | 

M _ cos? psiné 5.43 

dipole = = sind cos@ jJ’ (5.43) 


where 0 = l/p is the bend angle of the dipole. The geometric (natural) focusing is now clear. 
Note the matrix in the non-bending plane is a drift. An example of an accelerator dipole is 
shown in Fig 5.9. 

The consequence of Maxwell’s equations with no sources (specifically curlB = 0), dis- 
cussed in Chapter 2, means a horizontally focusing quadrupole is defocusing in the vertical 
plane, and vice versa. This follows from applying this Maxwell equation to the field of the 
quadrupole in free space and is due to the electromagnetic nature of the devices. However, 
all is not lost, and we can build systems of quadrupoles which overall focus in both planes by 
alternating polarity of quadrupoles. This alternating gradient, or strong focusing, principle 
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FIGURE 5.10 The path of a particle through a system of alternating quadrupole magnets. The first 
lens is focusing in this plane, and the second lens is defocusing (denoted by the shaded concave parts of the 
lens). The particle receives net focusing in both planes. 


was first proposed by Nicholas Christofilos in 1949, who patented rather than published 
the result. A group at Brookhaven National Laboratory — Ernest Courant, M. Stanley Liv- 
ingston and Hartland Snyder — independently discovered the same principle three years 
later when trying to solve an operational problem on the Cosmotron accelerator [4]. Today 
the strong focusing principle is central to the design of many particle accelerators. 
Consider the system of a thin-lens focusing quadrupole (focal length fı) separated by a 
drift (length d) from a defocusing quadrupole (focal length f2). If we compute the overall 
matrix of this system and look at the (2,1) element, this will give us the reciprocal of the 
system’s overall focal length, by comparison with the thin lens matrix for a quadrupole. If 


we do this we obtain 
1 1 1 d 


= | 
f fh h fife’ 
If we choose fı = — f2 = fs then the leading terms cancel and we obtain overall focusing 
in both planes at the same time, with focal length f = f2?/d. This is a very pleasing and 
surprising feature, and is the bedrock of many modern accelerators. We can understand this 
result by thinking of ray tracing and reference to the rays in Figure 5.10. Think it through 
yourself by visualising test rays at various transverse offsets. 
We have brought in the idea of a map in the form of a matrix, and this idea needs a 
bit of explanation. The matrix M is that map that brings an initial state vector X (so) to a 
final state vector X(s1), so that 


(5.44) 


X(s1) = M- X(s0). (5.45) 


For the linear case, the map can be represented as a matrix and the matrix representation 
is equivalent to the linear system. For non-linear systems, matrices do not work anymore 
and we need to find new representations of the maps, for example Taylor maps or Lie maps. 
For further discussion we refer the reader to Wolski’s textbook [1]. 

We know how to combine matrices (linear maps) with the rule 


M(s2|80) = M(sə|s1) 3 M(s1|s0), (5.46) 


again noting the order. Matrix algebra is not commutative, so we cannot switch the position 
of matrices in our expressions, or equivalently the order of beamline elements matters. But 
it is associative and we can form matrix sub-groups (provided we maintain the order of the 
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matrices!). One particularly useful map is the one-turn map. If we start at location s in a 
ring of circumference C’, then the one-turn map is defined as one turn around the ring 


M(s + CC). (5.47) 


This means the map for N revolutions of the ring is found from N applications to a given 
particle state vector of the one-turn map. We’ll come back to this idea when we discuss beam 
stability. The one-turn map in fact applies to any system, including a linear beamline, with 
periodicity. In this case it is called the one-period map and gives the transformation through 
one period. 

We’ve basically re-written our equations in terms of matrices. Is this useful? Yes, as it 
means we can use all the formal machinery of linear algebra, e.g. matrix multiplication, 
eigenvalues, eigenvectors, traces, similarity transforms, plus quickly see the impact on the 
beam of a series of elements. All this is very powerful and useful! Alas a lot of these ideas 
are beyond this introductory textbook. 

The computer realisation of these ideas gives birth to a simple beam-tracking code. 
These accelerator codes simply assume a piecewise-continuous representation of the accel- 
erator structure, with the order of elements the same as the real beam line being modelled. 
However, the number of matrices is not the same as the number of elements. This is because 
of edge focusing, which gives a kick to the beam at the entrance and exit faces of a rectan- 
gular dipole. We can write this kick as a matrix, and we’ll cover it later in the chapter. But 
be aware the numbers of matrices in a computer code is always more than the number of 
elements! 

Now we understand what an element does to our particle, we can track single particles 
through a composite system and, assuming the particles do not interact, many particles. 


5.4 The Courant-Snyder Formalism 


We’ve written down Hill’s equation for linear beam motion, determined that we can solve 
it, and written the solution using matrices and linear algebra. Hill’s equation is a second- 
order differential equation for a system with periodic focusing properties and we saw it is a 
little like an oscillating mass on a spring with a spring constant that changes with time. In 
fact, the variable spring constant k(s) for our accelerator in the quadrupole gradient and 
depends on the magnetic properties of the ring. If this ring has periodicity L, then so does 
the function k(s), 

k(s + L) = k(s). (5.48) 


Hence we can expect a kind of quasi-harmonic oscillation, where the frequency and ampli- 
tude depend on the location in the ring and show periodicity similar to that of the function 
k(s). All this means is that we can now follow the motion of particles through our beamline 
in terms of the transverse coordinates as a function of distance through the machine. 

The Courant-Snyder formalism, the subject of this section, solves Hill’s equations with an 
ansatz based on this intuition and parameterises the beam motion into a neat formalism. It 
also leads to a macroscopic description of the beam and the famous (6-function of accelerator 
lattice design. We assume a solution of Hill’s equation inspired by our intuition about the 
position-dependent amplitude and phase, namely 


x(s) = \/2AB(s) cos(a(s) + Yo). (5.49) 


This initially looks strange, so let’s pick it apart. 6(s) has the physical meaning of an am- 
plitude of the motion, which depends on the position s around the accelerator. (s) is a 
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position-dependent phase appearing inside our oscillating function and A is an overall con- 
stant. Because Hill’s equation is linear, the constant does not appear in it. We’ll see later 
that A is special and is called the single-particle emittance. We note that emittance is gener- 
ally used as a quantity describing entire beams and we, at the moment, are concerned with 
single particles. The factor of 2 is for later convenience in the definition of the emittance. We 
choose to use ‘single-particle emittance’ as opposed to ‘action’, to avoid confusion with more 
formal treatments involving Hamilton’s equations. Our treatment is not rigorous enough to 
use the word ‘action’ and we hope readers with knowledge of the action will forgive us; 
Andy Wolski’s textbook gives more details [1]. Our ansatz is, in essence, a parameterisation 
of the anharmonic motion of our particle, with a maximum amplitude that changes with s 
through the machine. We use 8 here to mean the (-function, and whilst the same symbol 
is used for the relative velocity 6 = v/c, it is usually clear from the context which quantity 
is being referred to. 

The variable 8(s) is the key quantity in the Courant-Snyder formalism and has many 
names: the ‘beta function’, the beam envelope function, the Courant-Snyder 6-function, the 
amplitude function and so on. It is always chosen to be positive. We’ll see that it represents 
the focusing properties of a lattice, and a small 6-function means a tightly-focused lattice. 
The periodicity of the magnetic system is very important, and this will mean 


B(s + L) = B(s), (5.50) 


for some periodicity L. So the -function follows the repeating structure of the beamline 
focusing elements. 

If we take the derivatives of the Courant-Snyder ansatz and substitute into the equation 
of motion, we find we get two terms: one proportional to cosine and one proportional to 
sine. This is a good exercise to do and we shall leave this to the reader. The coefficients of 
these terms must vanish separately, and we eventually obtain two differential equations 


Lge" — 58) — Bw? + Bk =0 (5.51) 
and 
Bw" + BY" = (BWY =0. (5.52) 


The second equation can be integrated immediately and, choosing a constant of integration, 
gives 


By’ =1. (5.53) 
Now we have an equation for the phase function 
$ ds 
p(s) = =. 5.54 
(s) BO (5.54) 


This position-dependent phase (known as the phase advance) is related to an integration of 
the 6-function along the beam line, and knowing the 6-function means we can compute the 
phase function. We can now eliminate the phase function from the first of the differential 
equations to get a differential equation for the 6-function 


1 1 

5 BB" — r +8k=0. (5.55) 
So knowing the distribution of focusing strengths along a beam line determines (s), al- 
though we rarely (i.e. never under normal circumstances) solve this equation in practice. 
Finally, we define the two functions (with (s), called the lattice functions), 


(5.56) 
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and j 
(8) = 1+ a(s)" 
B(s) 
Once the $-function is known, and hence a and y, the motion of a single particle is 
completely specified by specifying the single-particle emittance and the initial phase factor 
of the particle. So we have 


a(s) = y2Aß(s)cos(Y(s) + yo) 
2A : 
“(s) = — aS) [a(s) cos(w(s) + Yo) + sin(y(s) + yo)], (5.58) 
where the second equation is the derivative of the first. We can combine these two equations 
to give the quantity 


(5.57) 


Ba! + ax = —\/2AB(s) sin(v(s) + vo), (5.59) 
which means we can write an expression which is invariant for a particle 
x? + (Bx' + ax)? = 2AB. (5.60) 


By expanding the bracket and using the definitions of a and 8 we obtain 
yz? + 2axax! + px? = 2A. (5.61) 


This is a very important equation, so let’s look at it carefully. For every point in the 
accelerator we have a value of the functions a(s), G(s) and y(s). They depend on the lattice 
(through the focusing function k(s)) and are different for every point. At a particular point, 
if we combine the particle position and angle with these lattice functions we get an invariant, 
which was the single-particle emittance A we first saw in the solution to Hill’s equations in 
the Courant-Snyder formalism. As the particle moves to the next location in the accelerator, 
where we have different lattice functions, the particle has a different position and angle. 
However if we form this combination of quantities again, Equation 5.61, at the new location 
we get the same value as before. In other words, the single-particle emittance is a constant 
of the motion and always has the same value at every point. 

You may have seen this equation before in geometry. If not, imagine there was no gg’ 
term. What would it look like in the (x, x’) plane? It would be a circle, with equation 


yx? + Ba’? = 2A. (5.62) 


The conserved quantity 
ya? +2arz' + pr? =2A (5.63) 


actually describes an ellipse in the (x, x’) plane, with ellipse parameters described by the 
values of a, 8 and y. 8 controls the extent along the x-axis, y controls the extent along the 
x’ axis and a tells you how upright the ellipse is. The area of the ellipse is given by 


area = 712A, (5.64) 


so the area transcribed by the particle as it moves in (a, 2’) space is constant, since A is a 
constant. In general an ellipse may be described in the (x,y) plane as 


ca” + ery + c3y” = ca, (5.65) 


with area mc4/./c1c3 — c2. For our ellipse in the (x,2’) plane, we can find the points of 
intersection by setting x = 0 or 2’ = 0 and obtain 


A ,_ JA 
s= x - 4 (5.66) 
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The maximum values of x and x’ as the particle moves around the ellipse can be found by 
rearranging and differentiating, to obtain 


Zextreme = HV AB 
Ae, (5.67) 


Therefore, for a fixed A, the parameter G(s) controls the size of the particle’s excursions 
in position space, and the parameter y(s) controls the size of the particle’s excursions in 
angular space. When we come to talk about beams, which are collections of particles, we’ll 
see they are measures of the beam’s spatial size and angular divergence. When one is small, 
the other must be big and vice versa, since they are intrinsically linked. 

If you recall, the lattice parameters are functions of the focusing of the lattice so every 
point in the lattice has a value of the lattice functions. Hence every point in the lattice 
has its own orientation of the ellipse. A given particle has its own value of the single- 
particle emittance, thus setting the area of the ellipse it moves around. To see what is going 
on clearly, let’s play a mind game: we sit at one location in the ring and watch a single 
particle, turn after turn after turn. So every time the particle comes past us we write down 
its position and angle. This can be done with a simple computer code, and we generate the 
coordinates of the particle turn after turn, 


(ti 24); (z2, £3), (z3, £3), (z4, £4) (5.68) 


f 
Textreme 


where (x;,x;) are the particle coordinates on the ith turn. All of these points lie on the 
perimeter of the ellipse, as they must since our fixed point has fixed values of a, 6 and y, 
and A is invariant. Note the particle jumps around the ellipse and does not move around 
it continuously. If you wrote the tracking code in the previous section you could try this 
exercise for some stable lattice. 

The 6-function is a key quantity in the Courant-Snyder formalism. By definition we take 
B to be a position function of position s in the machine, and it carries the same periodicity 
that the lattice itself carries. It is determined by the focusing properties of the lattice, and is 
a function which is routinely computed in the design and operation of particle accelerators. 
It is maximum in a focusing quadrupole and minimum in a defocusing quadrupole. Let us 
now look at some examples. 

The -functions in each plane of the long straight section of the Large Hadron Collider 
are shown in Figure 5.11. We can see the periodic solution in the arc, and the small 8- 
functions in the middle of the plot (usually denoted §* in colliders), which correspond to the 
interaction point where collisions take place. We shall discuss the mini-beta principle soon. 
Note the large 6-function spikes, which correspond to large particle excursion. The section 
which smoothly joins the arc $-function to the minimum is called the matching section. 
And we can measure the 6-function too. Generally in science we can measure quantities 
if we change something they depend on in a systematic way. Hence careful changes of 
quadrupole currents allow 6-functions to be reconstructed. We shall discuss this more later. 
So we have a formalism for linear beam motion in terms of the Courant-Snyder parameters. 
These quantities are central to linear beam dynamics and are used to design accelerators all 
around the world. Let’s study them some more. It turns out that it is possible to write the 
transfer matrix between two points in a lattice in terms of the Courant-Snyder parameters 
at each of the two points and the phase advance between the points. 

Let us now write this general transfer matrix. To begin with, we return to the Courant- 
Snyder form of the solution to Hill’s equation; note that it depends on two constants and 
write this ansatz in a slightly different form, 


x(s) = cı y B(s) cos y(s) + c2 y B(s) sin w(s), (5.69) 
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FIGURE 5.11 The LHC -functions in the long straight section, 600 m either side of the interaction 
point. At s = O we have the tight focusing of the interaction point, and the large @-functions in the 
quadrupoles around this point arise from strong focusing. The periodic 3-functions in the periodic arc can 


be seen at large values of s . 


where cı and cz are constants yet to be determined. If we define the initial conditions at the 
point ‘0’ to be 8(0) = 8o, a(0) = ag and (0) = wo and write the initial particle coordinates 
to be zo and xj then we can fix the unknown constants to be 


C = ale 
1 af Bo’ 
OQ = V Box' + aaoi To. (5.70) 


vBo 


We see the expression for x(s) is linear in zp and 2x9, 


x(s) = [cos (s) + ao sin Y(s)] zo + yv Bob(s)xo sin (s). (5.71) 


Taking the derivative of this expression, we can cast this equation into a convenient matrix 


form as it’s linear, to get 
x x 
(2) =m (2 ) (5.72) 
$1 so 


2 (cose + ag sin Y) V 6189 sin Y 


A071 


TIT cosy ET sin Y & (cos Y — asin Y) 


where we have 


M(s1|80) = (5.73) 


The subscripts 0 and 1 refer to the beginning and end of the transfer map and w is (s1) — 
w(so). This means the transfer matrix between two points is purely determined by the lattice 
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functions at each point and the phase advance between the points. This is a remarkable 
and very useful result. 

The one-turn (strictly one-period) map is a very important quantity. Starting with our 
expression for the transfer matrix between two points, M(s1|so), we observe that the map 
for one turn of the ring means we come back to the same position. Hence we get 


Bi=Po = $ 
Qi = 01 = a 
p= = Y (5.74) 
and we have 
_ f cos Y + asin Y sin Y 
ae he) ( —ysin Vv cos VW — asin V ) , ea) 


where the phase advance over the period is V = Yı — Wo. This describes the transformation 
over one period of the accelerator lattice, and is called the one-turn, or one-period map. It 
is very important and tells us lots about the beam motion. We shall use it very shortly to 
examine beam motion stability. 

Before we turn our attention to the information contained in the one-turn map, let’s 
figure out how to calculate the lattice functions from it. If we multiply all the matrices for 
all the elements in the ring together, we obtain the total matrix for one turn of the machine 
(again, strictly, one period), which we write as 


M= ( Os. SNe Í (5.76) 


We can get the one-turn phase from the trace of this matrix by comparing it to the form 
we have for the one-turn map in terms of lattice functions, obtaining 


(5.77) 


W = arccos (=m) . 


Note that we get only the w part of 27+ y. We can get the lattice functions from the 
other matrix elements as 


™12 
pS sin UW’ 
os ™11 — M22 
2QsinU ’ 
M21 
= — A 5.78 
1 sin Y ( ) 


So we have a route to the lattice functions through the one-turn map. We compute this 
object and this gives the lattice functions at that point. This is how codes such as MADX [5] 
work. 

Note for the phase over the period or turn to be real-valued, and using our expression 
for the phase above, we see that the absolute value of the trace (m11 + M22) of M must be 
equal to or less than 2, ie. | Tr(M) |< 2. 

Imagine we know the one-turn map at one location, say s. Is there a way to figure it 
out at another location, say s’, provided we know the transfer matrix M for s to s’? The 
answer is yes. They are related to each other by a similarity transform, and so 


M(s’ + Cls) = M(s'|s)- M(s + Cls) -M~1(s'|s). (5.79) 
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We shall state this without proof. Similarity transforms come from matrix theory and 
lead to all manner of nice properties such as identical eigenvalues and traces before and 
after the transformation. Let’s be concrete and denote the matrix M (from s to s’) by 


M= ( pe eee J (5.80) 


M21 M22 


Let’s use this to figure out how the lattice functions transform from place to place if we 
know the transfer matrix. Starting with the similarity transform, we can express the one-turn 
maps in terms of the lattice functions at the locations s and s’ as our standard expression, 


M(s'|s) = ( cos Y + asin Y Bsin Y i: 


—ysin Ų cos VW — a sin Y 
Performing the similarity transformation we can obtain expressions for the lattice functions 
at position s = 1, given those at s = 0, as 


(5.81) 


Qı M11M22 + ™M12M21 —MıM2ə1 —Mı2M22 ao 
= 2 2 
By = —2m11Mı2 Mii Miz . Bo . (5.82) 
2 2 
yı —2M21 M22 M31 M322 Yo 


So knowing M, we can transform the lattice functions to any point in the beam line. Needless 
to say, this expression is important and very useful. 

So we know how particles evolve (transform) in accelerator elements. How do the lattice 
parameters transform? Let’s look at a drift space of length L, with an incoming particle 
described by xo and 26, the incoming lattice parameters are 6o, ap and yo, and all quantities 
evolving to position 1. The transfer matrix is 


maiy=( 9 7). (5.83) 


and the particle evolves as xı = xL + zo and x, = z1. Recall we are evolving from s = 0 m 
to S = 1 m. What about the lattice parameters? They evolve as 


Bı = bo- 2L + yL? 
Q1 = ao — yol 
y = y (5.84) 


Note the quadratic term in the evolution of 8, which we shall return to soon. 
Several times we have used the phase advance for one turn of a ring (period), 


s+L ds 
v= I _ (5.85) 


i.e. given by an integral over the $-function. We often call the phase advance for one turn 
of a ring the tune (denoted v or Q), and express it in units of 2 x r. 


Y 1 f+ ds 
v=>= i Sa (5.86) 


There is one tune for each plane, including the longitudinal plane, and it’s a very important 
function for beam dynamics. Note we can evaluate the tune at any point in the ring and 
always get the same answer (a property not shared by a, 8 and y). This is because the 
trace is invariant under similarity transformations. 

Note that all our equations in this section assume the beam motion is linear in the trans- 
verse coordinates. With the caveat in mind, we can start to build complicated arrangements 
of magnets. This art is called lattice design. 
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5.5 Lattice Design 


In this section we shall construct our lattices from some basic building blocks, made of the 
magnets we have met so far. The art of lattice design for machines around the world works 
in this way, and we shall see how small chunks of magnet layout are created and combined 
to give larger structures. While what we do applies equally to circular and linear machines, 
we often use circular machines as the example, and will point out along the way how linear 
machines are designed. 

Let’s start with the basic bending we need, which means dipoles. Synchrotrons, and 
sometimes high-energy particle colliders, are circular machines, so we need plenty of dipoles 
in the lattice to bend the particles around the tunnel or ring. This creates the design orbit 
of the machine, and the middle of our moving coordinate system. Then, once the design 
orbit is sorted out, we need to design the magnetic lattice, and position the quadrupoles 
and higher-order magnets. This process is called lattice design. Fig 5.7 shows a section of 
an accelerator lattice, showing the arrangement of dipoles and quadrupoles in the lattice — 
lattice design is deciding the placement and strength of these elements. 

Our first task to figure out the geometry of the ring, and define the curved reference orbit 
using a layout of dipole magnets. This forms the fundamental footprint of the machine and 
defines our coordinate system for future analysis. The use of dipoles to form the bending of 
an accelerator is a very important application. In circular machines, the dipoles are needed 
to form the ring geometry and must add to a total bend angle of 27. 

Consider a particle bending through angle d0, with arc length ds and bend radius p, 
such that 0 = ds/p. For a weak bending magnet we can approximate ds as dl, where dl is 
an element of the length of the magnet. The integral over the magnet length gives the total 
bend angle, 

A f Bal 
Bp 


which we need to be 27 for a circular tunnel to get all of the way round the circumference 
of the machine’s footprint. 

For an example, consider the LHC. This is a two-beam circular proton synchrotron at 
CERN, Geneva. Here we have 1232 dipoles, each of 14.3 m length, and each beam has a 
design momentum of 7 TeV/c. The rigidity of this beam was discussed in Chapter 2, where 
we defined the beam rigidity as 


(5.87) 


(Bp) = p/4. (5.88) 


For the LHC we need NIB = 2r - p/q to complete the ring so the required field is 8.3 T, 
which is of course the design strength of the LHC dipoles. 

We should say that dipoles are also very important in linear beam lines, to give the 
right angle of beam delivery. They are also used in a transfer line to form dog-legs and 
chicanes designed to manipulate beams, especially for bunch compression. For example, a 
dog-leg in an electron machine to perform bunch compression. Other uses include removal 
of background particles in a linear collider and separation of beams in the LHC. 

The dipoles are now defined and the basic machine geometry fixed. Now we need to 
concern ourselves with linear beam focusing and the quadrupoles. For this we need the 
principle of alternating gradient. Recall that two quadrupoles of opposite polarity could 
provide focusing in both planes at the same time. This fantastic result is one of our fun- 
damental building blocks — the FODO cell — and allows us to construct periodic, stable 
structures. The FODO cell consists of a horizontally focusing quadrupole (F), a space (O), 
a defocusing quadrupole (D) and a space (O), giving an alternating gradient layout. We 
can repeat the FODO cell to make a FODO channel of arbitrary length. Note the drift 
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space (O) can contain nothing, a bend, some diagnostics, an RF cavity or even an entire 
experiment in some cases. 

To understand the beam dynamics in a FODO cell we need to compute the one period 
map, giving the linear motion over one FODO cell. To do this we simply multiply the 
matrices of the components of the cell together, conventionally starting in the middle of one 
of the quadrupoles, which means we start and end with a quadrupole matrix of half strength. 
This is not so strange and ultimately means we find the maximum and minimum -function 
points, which occur at symmetry points of the cell in the middle of each quadrupole. 

Recall our matrices describing the action of the linear accelerator elements on the beam. 
For the focusing quadrupole we had 


= cos(V K's) -L sin(VKs) 
ae ( -VKsin(vV Ks) ee O89) 
and for the defocusing quadrupole we had 
ae ( VED) px EY | (590) 
+4/|K[sinh(.//K|s) — cosh(./|K]s) 


We also need the matrix for a drift, 


1 L 
Marit = ( 0 1 ) f (5.91) 


and the matrix for a thin-lens quadrupole, 


1 0 
Mihin = ( i 4 ) ; (5.92) 
f 
Using the thick lens elements, we can multiply these matrices in sequence, in ‘FODO’ 


order, 
Mrono = Mp/2° Marite © Mp - Maritt © Mp2. (5.93) 


Let’s take some real numbers for a real machine to give a feeling for quantities. Let 
us take our quadrupole strengths to be K = +0.54102 m~?, the quadrupole lengths to be 
1, = 0.5 m and the separation distance to be L = 2.5 m. This gives, if we do the maths 


0.707 8.206 j: (5.94) 


Mrono = ( —0.061 0.707 


This is the one-period map of the FODO cell, and contains a lot of information on the 
stability and the focusing properties of our lattice. First of all, we can ask if the FODO cell 
stable? For this we need the (absolute) trace of the one-turn map to be less than or equal 
to 2. Here it is 1.415. So this FODO cell will give stable dynamics in this plane and the 
particle motion is bounded. Make sure you can repeat these calculations. 

What is the betatron phase advance per cell? Recall that 


(5.95) 


Mii + M22 
W = arccos | ——— ], 


2 


and so the phase advance per cell is 45°. This is a 45° cell. 
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What are the lattice functions at the point of the one-turn map? For us, this is in the 
middle of the focusing quadrupole. Well, we use 


™12 
ee sin U’ 
oe eS Mii — M22 
2 sin Ų 
M21 
= ———— 5.96 
1 sin UV’ ( ) 


and find that 8 = 11.611 m and a = 0. (For this case, what does the ellipse look like?) 
What does MAD compute? Try it yourself (www.cern.ch/mad)! Or construct your own code. 

We can also make our life easier and compute the matrix for our FODO cell using the 
thin-lens matrices. Again, starting from the middle of Qr we have 


tome (4 CG NG DGD) om 


Doing the mathematics, we end up with the matrix in terms of L and f 


1- ža a) 
2 ] 


(5.98) 
aly) l-ap 


Mrono = ( 


which contains lots of information. Straight away we can ask, for what parameters is the 
FODO cell going to give stable motion? This means 


L 
|TM) |s M |fl2 5- (5.99) 
We can also write the cell phase advance in terms of the parameters 
xi 1 r A L? 
cos VW = zTM) =1- 2 (5.100) 


Our stability equation seems to say motion is stable when focusing is weak (long focal 
lengths)! Strong quadrupoles aren’t always the way to go to get stable motion and controlled 
ß-functions. 

Now we can compute the lattice functions in the cell. Note that 6 in the focusing and 
defocusing quadrupoles are maximised there, and this maximum depends solely on the cell 
length and phase advance. Using 


_ M12 
B= sin VU’ 
Mii — M22 
aa oo 5.101 
I=- snt am 
we obtain in the focusing quadrupole 
2L(1 + L/2f) 

= see = 0. 102 
Br sin U » ap=N goat?) 


The expression for Bp can be obtained in a similar way. 

So we build our ring out of dipoles and FODO cells. What about an experiment or a 
region free of magnets for diagnostics? We need to stop focusing for a while, so we should ask 
what will happen? Remember we derived the expression for the evolution of the -function 
in a drift as 

By = Bo — 2ao L + yL’, (5.103) 
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showing what happens to our beta function in a drift. At a symmetry point a = 0 and 
y = 1/6 giving the increase of the 6-function after the symmetry point as 


Z 
Be? 
where we denote the -function at the symmetry point as 6*. This is very bad for accelerator 
designers! What happens can be understood in terms of the ellipse. The area of the ellipse 
is constant, so squeezing @ means we increase y, so the beam rapidly diverges after it leaves 
the symmetry point. This is an example of Liouville’s theorem, which states that the the 
area occupied by a beam in phase space is constant as it moves through the accelerator. 
We saw our ellipse area was constant, which is a consequence of Liouville’s idea. 

Fig 5.11 is the region around the ATLAS experiment in the LHC. Here we have a waist 
(a minimum in the 8-function) at the ATLAS interaction point at s = 0 m, and sitting at 
s = + 22 m are strong quadrupoles (in fact a triplet of quadrupoles) to make the beam 
waist. Around these we have matching quadrupoles to match the (-function back into the 
periodic solution in the LHC arc FODO cells. A problem with large $-functions in the 
triplet quadrupole is that this implies a large aperture requirement due to large beam sizes, 
as well as other problems we shall see later. 

Accelerator design starts with defining the geometry of the machine, attending to the 
dipoles and then using a linear approximation to analyse the linear dynamics. For a modern 
accelerator the machine parameters are defined, with many constraints such as cost, where 
tunnels can go, what the accelerator is specified to achieve and so on. The gross parameters 
such as energy, luminosity, radiation output, etc. are defined. This is the first step and 
defines the global properties of the accelerator and its broad aims. At this stage the user 
community should be involved to ensure the machine will meet the user need. 

Next, we need to consider magnetic technology to define the maximum dipole and 
quadrupole strengths. This defines the geometry of the machine. Then the linear lattice 
is then constructed based on the fundamental building blocks. The linear lattice should 
fulfill the accelerator physics criteria and provide global quantities such as circumference, 
emittance, betatron tunes, magnet strengths, and some other machine parameters. Design 
codes such as MADX [5], ASTRA [6] and GPT [7, 8] (the list is nearly endless!) are used for the 
determination or matching of lattice functions and parameter calculations. Periodic cells are 
needed in a circular machine. The cell can be the kind we have looked at, namely FODO, 
or many others we can come to later after we have discussed dispersion. Next combined- 
function or separated-function magnets are selected and matching or insertion sections are 
introduced to get the desired machine functions in an experimental region or an undulator, 
for example. There is more to do, but we shall come back to this recipe for accelerator 
design once we’ve learned some new concepts. 

As an aside, let’s consider an open-ended design problem. Design a lattice with four 
identical FODO cells, with each cell containing a thin lens, a dipole, another thin lens and 
another dipole. The machine should store protons with a total energy of 3 GeV per proton, 
with a bend radius of around 80 m. Choose suitable values for the quadrupole strengths, 
drift lengths and bending radii so that the motion is stable. Plot the beta functions and 
dispersion in both planes and calculate the ring tunes. What is the momentum compaction 
factor? You could use MAD, or any other suitable code or programming language. 

We shall finish this section with one more matrix we need to know. So far we’ve defined 
our linear matrix formalism and figured out matrices for drift spaces, quadrupoles and 
dipoles. The latter matrix is a purely focusing effect in the plane of the bending, with the 
bending effect of the dipole contained in the co-moving coordinate system. This means the 
beam changes in angle when it passes through the dipole. When we come to build this 
magnet, we have a choice to make. The first choice is called a sector dipole, where the beam 


B(s) = B* + (5.104) 
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is perpendicular to the entrance and exit faces and the magnet follows the curved trajectory 
of the reference orbit. A second choice, which is easier to fabricate, is a rectangular dipole 
where the entrance and exit faces are parallel to each other and so the curved trajectory 
of the beam makes an angle with the entrance and exit faces, normally written as % in 
the literature. The impact of this entrance and exit angle is that the beam receives a small 
focusing kick, essentially due to the larger or lesser amount of magnetic field seen by the 
beam. This is called edge focusing. The particle is bent through an angle of 


zo tan Y 


Aé = 
R 3 


(5.105) 


where R is the bend radius of the dipole and xg is the transverse position of the particle 
under consideration. Hence we can write the coordinate transformation as 7 = x9 and 


T = To + To R’ (5.106) 
or as a matrix as 
1 0 
Meage = ( tany 1 ) . (5.107) 
In the vertical plane there is also a focusing effect, which is given by 
1 0 
Medge = ( tan ) : (5.108) 
-a 1 


So we see a positive ~ causes horizontal defocusing and vertical focusing. 

We should mention at this time that magnetic fields extend beyond the physical extent 
of the magnet with a non-linear character that is not fully included in the effective length. 
These fringe fields can disturb the beam in strong magnets and can be very important to 
the dynamics. For full details see [9, 10, 11]. 


5.6 Errors and Misalignments 


In our analysis we started with an arbitrary field and made an expansion, which we called 
the multipole expansion. When we used the expansion in our discussion of Hill’s equation, 
we only kept the first two terms, equivalent to the constant and linear terms in a Taylor 
series. These correspond to dipole and quadrupole fields. The complete multipole expansion 
for the transverse fields looks like 


By+iB, = X Onz! 


Í 

M 
2 
SS 
+ 
= 
3 


as we saw in Chapter 4. Here, Cn are the multipole coefficients. In this expression we have 
our dipole and quadrupole fields, plus higher-order terms like the sextupole, octupole and so 
on. Linear beam dynamics is the study of the dipole and quadrupole terms, and non-linear 
dynamics is the domain of the higher-order terms. 

To realise these fields in a real accelerator we build the magnets which present the 
required multipoles to the beam, and to do this we need to specify some field quality. These 
magnets will never be perfect and the design and construction of them will lead to multipole 
coefficients different from the design values and the addition of extra multipoles within the 
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FIGURE 5.12 Closed-orbit distortion from a single dipole kick, showing the change in the motion of 
the particle after the kick to a new closed orbit about the ring. 


constraints from symmetry and fabrication tolerances. Therefore these magnets will have 
mostly the field component we want, but will have small contributions of higher-order field 
components. This is what we saw in the chapter on magnets. What do these unwanted 
multipole terms do to the beam? In the design process, most synchrotrons specify a field 
quality of one to ten parts in 10,000, which may not seem much but can cause many problems 
for our beam. We need to be able to calculate the resulting motion. We also need to align 
magnets correctly, otherwise this will lead to further unwanted effects on the beam. For 
example, a quadrupole can be misaligned either horizontally or vertically, so generating an 
additional dipole field in the beam. A further origin for unwanted fields on the beam is 
from a power supply to a dipole or a quadrupole that may vary over time, thus producing 
a field which is not precisely what is required. The bottom line is our lattice is never as we 
designed it and we need to deal with field errors for every machine we attempt to build and 
operate. 


5.6.1 Closed-Orbit Distortion 


The design orbit defined by all of the dipoles in the ring is also known variously as the 
reference orbit or reference trajectory. For an ideal machine this is the trajectory that goes 
through the middle of each magnet, closes upon itself in a circular machine, and is the 
reference orbit to which we define the particle coordinates. This is the desired situation but 
is not achieved in real rings. If there is a small dipole kick at some location, arising from 
any of the reasons we have just discussed like a quadrupole misalignment or a power supply 
error, the beam will feel an extra kick and this orbit will distort. This distortion will run 
around the entire ring or along the entire beamline. This is shown in Figure 5.12, where we 
see the orbit change resulting from a kick at a fixed location, denoted “kick location”. An 
important consequence of this is that a small kick at some location will be seen everywhere 
in the beamline or ring! 

This closed-orbit distortion defines a position-dependent orbit offset around the ring, 
which can be seen in the figure. In effect, the particles no longer oscillate around the design 
orbit but around a new closed orbit, meaning the particles oscillate not about the middle 
of every magnet but some other orbit x(s), where 


x(s) = £g(s) + &co(s). (5.109) 


Here xg(s) denotes our betatronic oscillations around the ideal orbit and rco(s) denotes 
the position-dependent shift of this reference trajectory. This new orbit xco(s) must obey 
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the periodicity of the ring. How do we find it? Well, a short but not terribly enlightening 
calculation involving inclusion of a kick in our formalism gives 


zoo(s) = 0Y EOFS) cos ary — (s) — v(s0))). (5.110) 

sin TV 
where a dipole kick of angle 6 is located at location sp. The betatronic phase at s is denoted 
p(s) and the tune of the machine is denoted by v. 

Note that our expression for the closed-orbit distortion has a denominator containing 
the sine of the tune multiplied by 7. This means that if the tune is an integer, the argument 
of the sine becomes a multiple of 7, and so this factor diverges and gets very big. This 
means the closed-orbit distortion, proportional to this quantity, gets very large. This is an 
example of resonance, where the machine tune is such that harmful beam behaviour occurs. 

Let’s think physically what that means. Imagine the tune was set to 2 in a machine, 
meaning the beam made one complete betatron oscillation every turn of the machine. Then 
the particle would encounter a particular dipole error at one point in the machine every turn, 
and at the same point in its betatron oscillation. This means the effect of the dipole error 
adds up turn after turn after turn, pushing particles to very large excursions transversely. 
This is clearly bad. We avoid this by minimising magnetic dipole errors and staying away 
from dangerous values of the tune. Here, because of our dipole error, we should avoid integer 
tune values. 

We’ll soon see there are many other resonances which occur at other tune values. We’re 
about to analyse the next kind of magnetic error — quadrupole errors — which will mean we 
need to stay away from half-integer tune values to avoid harmful behaviour. 

It will turn out that generally resonances occur when the tunes of the machine in both 
planes satisfy the condition 
MVz + Vy = p, (5.111) 


where m,n, p are integers. This contains the condition for our dangerous integer and half 
integer tune values, and much more besides. The order of the resonance is given by m +n. 
Note that this condition not only includes constraints on either the horizontal or vertical 
tune to avoid resonance, but also resonance conditions that mix the horizontal and vertical 
tune. These are called coupling resonances, with the lowest-order coupling resonance having 
the condition 

Vy £ Vy = pP, (5.112) 


and this is known as the linear coupling resonance. This resonance, driven by non-linear 
elements, couples both transverse planes together and leads to the exchange of motion and 
beam emittance between the planes. Further terminology is a sum resonance, which is a 
positive sign between v, and vy in Equation 5.111, and a difference resonance, which is a 
negative sign between v, and vy in Equation 5.111. A structural resonance is the case of 
the integer p corresponding to the superperiodicity of the machine, as these resonances are 
especially strongly driven and hence dangerous. 

A very common plot is the resonance condition plot, where we plot all the resonance 
conditions on a plot of (vz, Vy). This plot is shown in Figure 5.13. Each condition corresponds 
to a line on the plot, at some order. The tunes of the machine in each plane are chosen 
to avoid these resonance lines, and this tune point is known as the working point of the 
machine. 
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FIGURE 5.13 Diagram of the resonance conditions of a circular machine. The lines correspond to the 
solutions of the resonance condition described in the text and represent harmful conditions for the beam. 


The lines are plotted in the space of (Vz, Vy). 


5.6.2 A Quadrupole Error 


Now imagine we had an extra quadrupole in our ring, or a quadrupole field error. This 
would change the focusing, so every quantity associated with focusing will change. This will 
perturb the beam away from the design, and cause: 


1. change in the tune of the machine, and 
2. change in the 6-function of the machine all round the ring, known as (-beat. 


Let’s calculate it. Imagine our quadrupole error had integrated strength KL = +q. This 
means it has a matrix which kicks the x’ of the particle and looks like 


( = : i (5.113) 


If we represent the rest of the machine by the one-turn map, 


(5.114) 


MGR Days ( cos Ų + asin Y Bsin Y ] 


—ysin Y cos Ų — asin V 


then the effect on the global dynamics of the machine can be calculated from the matrix 
product to give a new one turn map 


_ ( cos Y + asin Y sin Y 1 0 
Miso I80) = ( —ysin Ų cos V — asin Y ) ( —q 1 ) i (8-115) 
This gives, if we spend five minutes doing the matrix multiplication, 
_ cos 27v + ao sin 27v — qbo sin 2rv Bo sin 2rv 
M(s0 + L|so) = ( — 0 sin 2rv — q(cos27v — agsin27v) cos2rv — ag sin 2rv 


(5.116) 
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This is the perturbed one-turn map, and all symbols with a 0 subscript represent the values 
of the unperturbed machine. If we denote the tune and lattice functions of the perturbed 
machine by a subscript p, then the one-turn map looks like 


COA E COS 2TVp + Asin 2TVy p sin 2nvp (5.117) 
8 ee —ysin 27, cos 2TVp — asin 27, f 
Equating the traces of these two matrices gives 
2 cos 2TVp = 2 cos 2rv — qbo sin 2rv (5.118) 


which relates the unperturbed and perturbed tune. We note if q is small, then the perturbed 
tune is close to the unperturbed tune, so v ~ vp. Let’s assume the tune shift is small, and 
write Vp = v + dv. If we then expand the cosine function using a standard identity 


2cos 2a(v + dv) = 2 cos 2rv - cos 2mdv — 2sin 27v - sin 2rdv. (5.119) 


To simplify this expression we recall the quadrupole error in our lattice is small and so the 
tune shift dv is small. Hence we can assume that cos 2adv ~ 1 and sin 2mdv ~ 27dv, giving 
the important result 


qbo = 4rdv, (5.120) 
and so the tune shift from a small quadrupole of strength q is 
qo 
Av = ú) -v = I (5.121) 


This is a very important result. Note the following important features: 


1. The perturbed tune increases if q > 0, which corresponds to a focusing quadrupole i.e. 
more focusing means more oscillations. So we get a positive tune shift for increased 
particle focusing. 


2. This means a pure quadrupole field error would shift the tune one way in one plane and 
the other way in the other plane. However, note that we can also get tune shifts from 
space charge, beam-beam effects and electron clouds, which can cause same-sign tune 
shift in both planes. 


3. The effect of the quadrupole error is proportional to the local (6-function. This is a 
common feature that the -function magnifies local field errors. 


If we have a distribution of quadrupole errors around the ring, k(s), the approximate 
tune shift can be calculated from 


Av = Tf Assis). (5.122) 
4T 

We note this can also be used to measure the 6-functions. To do this, we vary a single 
quadrupole in the ring, and measure the tune, as the response of the beam is proportional 
to the 6-function. In general, the -function tells you how sensitive the beam is to pertur- 
bations. For example, for LHC luminosity upgrades, we may have to live with very large 
-functions in the arcs of the LHC. This means the proton beams will be more sensitive to 
field errors. 

What about the change in beta function due to our quadrupole error q at sg? Skipping 
the derivation (which is short and not particularly enlightening), we obtain 


A 
: j EO cos(2rv + 2\%(s) — %(so)]). (5.123) 
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Note the 6-perturbation is a function of s, so is a ‘beta wave’ around the ring The distortion 
oscillates at twice the betatron frequency, which is why it’s called a 6-beat. Note also the 
strength of the distortion is proportional to the quadrupole error and also the beta function 
at the position of the quadrupole error. The 6-beat measured in the LHC is shown in 
Figure 5.14. 

Finally, we have a sin 27v term in the denominator. This means the expression will get 
very large whenever the tune approaches a half-integer. This is resonance again, and means 
large particle amplitudes are driven for half-integer machine tunes. 


5.7 Off-Momentum Particles 


5.7.1 General Considerations 


So far we have considered beam motion when the particles have the design momentum p. We 
refer to these particles as on-momentum particles, and this defines the ideal, on-momentum 
motion. However, in general, a particle’s momentum will be p + (something small), where 
the beam consists of a group of particles with some distribution of momenta. In fact when 
we come to talk about longitudinal dynamics, it’s a necessary consequence of longitudinal 
stability that we have a range of momenta in the beam. And so we need to think about what 
happens when we have particles which are not at the momentum for which we designed our 
machine. 
So let’s write, for our momentum, 


p+ Ap = p(1 +ô), (5.124) 


where ô = Ap/p and parameterises the deviation of a given particle’s momentum away from 
the design momentum. Now we can explore the consequence of non-zero 6. So how does this 
change our picture? 

Let’s think about dipoles and bending first. Imagine we design our accelerator and 
figure out we need a certain dipole field strength to bend a particle of a certain momentum 
around a bend in the tunnel. This momentum is the one we design our accelerator for and 
so is called the design momentum. We send our design particle with the design momentum 
into this dipole and it bends through the right angle. This defines the machine geometry. 
Now imagine we send through a particle with slightly less momentum. What will happen? 
Well, the dipole field strength of the magnet is fixed so the particle will get a change in 
its angle which is greater than the design particle. Hence the trajectory, or orbit, of this 
off-momentum particle will be different. Imagine we send through a particle with slightly 
more momentum. Now the particle will get a change in its angle which is less than the 
design particle due to the fixed field. Hence the trajectory, or orbit, of this of-/momentum 
particle will also be different. This change in orbit for particles with differing momenta is 
called dispersion because particle beams with a spread of momentum get dispersed by a 
dipole. 

Let’s now consider our particle with some momentum less than the design momentum 
passing through a quadrupole. What happens now? Well, the quadruple is designed to focus 
particles with the design momenta to a single point, and so our particle will see too much 
field, be over-focused and so not be focused to the correct point. Similarly, particles with too 
much momentum will not be focused enough. The quadrupoles, through their k distribution, 
fix the focusing of the lattice and so the -function of the lattice and the tune will change. 
The change in these quantities is said to arise from chromaticity, or momentum-dependent 
focusing. 
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FIGURE 5.14 The -beat in the LHC as measured in 2015 and 2016. In a perfect machine AB/6 = 0 
at all locations. Used with kind permission from [12]. 


196 The Science and Technology of Particle Accelerators 


Let’s be more quantitative. Recall our derivation of Hill’s equation gave us 


" 1l — 
x” (s) + (rc) + =) =0, (5.125) 


in the horizontal plane and 
y” (s) + ky(s) =0 (5.126) 


in the vertical plane. In these equations, earlier in this chapter, we dropped 6 where it 
appeared. We need to figure out how to modify these equations to retain the effects of 
a particle being off-momentum. We do this by retaining terms of the coordinate 6 we 
didn’t previously retain. Let’s explore them one by one. We know that we build accelerator 
beamlines by using magnets, with dipoles and quadrupoles being our basic building blocks, 
and we shall find the retained terms affect all of our elements. We shall also find that our 
particle path length will change for of-momentum particles, an effect known as momentum 
compaction and crucial for the longitudinal motion in our accelerators. We shall consider 
each case one by one. 


5.7.2 Dispersion 


First of all, let’s consider trajectory change, called dispersion. In this case, the derivation of 
Hill’s equations needs to be modified to retain terms in the expansion that are linear in ô, 
which arise from the expansion of the momentum in terms of the momentum deviation ô, 


myvs = p(1 + ô). (5.127) 


In this equation, m is the particle mass, y is the relativistic gamma function and vs is the 
particle speed along the reference trajectory. The complete derivation is left to the reader [3], 
but is straightforward, and if we do this we obtain the revised Hill ’s equations 


1 ô 
x” (s) + (rt) + T) =-, (5.128) 
in the horizontal plane and 
y” (s) + ky(s) =0 (5.129) 


in the vertical plane. Note the vertical plane equation is unmodified as the bending, in our 
analysis, is purely in the horizontal plane. We still have the definitions 


g 1 


kr = s+, 5.130 
Bot p (5.130) 
and the equivalent for the vertical plane, 
g 
ky = — =. .131 


The new Hill’s equation in the horizontal plane is the inhomogeneous equation of motion, 
like before but with a non-zero right-hand side term not containing x or its derivative. This 
is the inhomogeneous term and leads to dispersion. The extra term on the right-hand side, 
proportional to 6, will drive the horizontal motion of an off-momentum particle, which we 
shall call horizontal dispersion, or simply dispersion. Note there is no dispersion-driving 
term in the vertical plane as there is no bending for our derivation. 

The general solution for the horizontal motion of a particle is given by the sum of two 
terms: the betatron motion term xg(s) and an off-momentum dispersion term 


x(s) = xg(s) + za (8). (5.132) 


Single Particle Motion 197 


x, = D(s).6 


FIGURE 5.15 The orbit offset of dispersion, showing the shift from the on-momentum orbit by D(s) 0. 


We can think of xg(s) as a closed-orbit term, around which 2;,(s) oscillates. This follows 
from the theory of differential equations, where we add the special solution obtained from the 
driving term to the homogeneous version of the differential equation. Essentially, dispersion 
is a shift of the closed orbit around which the betatron oscillations occur, as shown in 
Figure 5.15. 

To analyse and understand our dispersive motion it is convenient to define a special 
trajectory, D(s), which is that trajectory followed by a particle that has 6 = 1. This tra- 
jectory, while physical, has no particles following it as they would be lost due to the large 
transverse deviation, but is a tool to parameterise the motion. So let’s consider our newly 
defined dispersion function D(s). This is actually a physically allowed orbit, and the one 
a particle with ô = 1 has should this particle exist. As this D(s) is a physical orbit it is 
focused by the lattice, meaning both dispersion and dispersive motion is focused by the 
lattice. The motion of the particle is the sum of our old x(s) and the dispersion, so that 


x(s) = g(s) +ô- D(s). (5.133) 


One way of viewing this equation is thinking of the dispersive term as a closed orbit around 
the accelerator, and a particle oscillates around this dispersive orbit through the usual 
betatron oscillations. This is like a dipole error closed-orbit distortion. What are typical 
values? Well, xg is typically a few mm, values of D(s) might be < 1 metre, and 6 might 
typically be 0.001. 

So how do we calculate D(s)? We need to find a solution to the inhomogeneous Hill’s 
equation and add it to the general solution of the homogeneous equation, so we need to 
solve 


x" (s) + (rc) + T) = = (5.134) 


To calculate D(s), consider motion in a dipole (so no gradients) and we have 6 = 1 for the 
trajectory corresponding to D(s). Therefore D(s) is a solution of the resulting inhomoge- 
neous equation 

Di (5.135) 
pep , 

We have already solved the homogeneous equation (with the right-hand side equal to 0) as 
this is the matrix we already found for a dipole. Now we need to find a particular solution of 
the inhomogeneous equation and add this solution (Dz) to the solution of the homogeneous 
equation. Since the right-hand side is a constant, then a valid choice of a particular solution 
is a constant: we can try a constant as a solution 


D"(s) + 


Dr=C (5.136) 
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and we can readily find that D = p by inserting this into the equation of motion. This 
means our general solution is 
D(s) = Acos(s/p) + Bsin(s/p) + p, (5.137) 
and its derivative is A B 
D'(s) = —— sin(s/p) + — cos(s/p). (5.138) 
p p 


We can find A and B from the initial conditions by noting that D(s = 0) = Do and 
D'(s = 0) = Dj and so we have equations to evolve the dispersion function D through the 
dipole, which are 


D(s) = D(0)cos(s/p) + D’(0)psin(s/p) + p(1 — cos(s/p)), 


D'(s) = -20 sin(s/p) + D' (0) cos(s/p) + sin(s/p). (5.139) 


These equations are linear and readily written as a matrix equation, 


D(s) cos(s/p) — psin(s/p) p(l — cos(s/p)) D(0) 
D'(s) | =| —5sin(s/p)  cos(s/p) sin(s/p) D'(0) |. (5.140) 
1 0 0 1 1 


Note the upper-left 2 x 2 matrix is just the transfer matrix for a dipole we have already de- 
rived. This means the dispersion function obeys the matrix equations we know already; in a 
dipole, dispersion is also produced (or driven). The dispersion function in a quadrupole obeys 
the quadrupole transfer matrix, and so the dispersion function is focused in a quadrupole 
in the normal way. However, there is no extra dispersion driven in a quadrupole, and so 
M13 and Məs are zero in the matrix. 

Finally, as the motion is given as the sum of the betatron motion and the dispersion 


x(s) = xzg(s) + D(s)ô. (5.141) 


The general motion of a particle can be written as a 3 x 3 matrix equation, 


x(s) Mi Miz D «(0) 
x'(s) = Mo, Moz D' x’ (0) . (5.142) 
ô 0 0 1 ô 
For a short sector dipole with bending angle 0 small compared to 1, 
l 
d= A «l, (5.143) 


we can write this matrix in the simpler form 
1 1 
Ol 0 i (5.144) 
0 0 


This is useful for quick calculations and corresponds to having a thin-lens kick for an off- 
momentum particle. A quadrupole has no driving term for the dispersion and the 3 x 3 map 
is given by 
My, Mi 0 
Mə Mə 0 |. (5.145) 
0 0 1 
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In this expression, the unity factor in the (3,3) element simply expresses the invariance of ô 
(the momentum is unchanged). When we considered '-functions, we looked at the form they 
took in our basic lattice building block — the FODO cell. What happens to the dispersion 
in a FODO cell? Consider a FODO cell with thin-lens quadrupoles. Now that we know 
dispersion is driven by dipoles, we can calculate the dispersion function in the same way 
we computed the 6-function. Let’s find the dispersion at the middle of the F-quadrupole, 
so we have a magnetic arrangement (with B denoting a dipole) 


Z Bg Z, (5.146) 


Looking at only the horizontal motion we find the one-cell map can be constructed from 
half-quadrupole maps, full quadrupole maps, and the thin-lens dipole map. Multiplying 
these five matrices together, 


1 0 0 1 L L8/2 1 0 0 
M= Sor 1 0 0 1 6 F 1 0 |x 
0 0 1 0 0 1 0 0 1 
1 L L0/2 1 0 0 
01 9 Sop 10], (5.147) 
00 1 0 0 1 
we arrive at 
T L L 
Loe i 2L(1 + 27) 2L0(1 + a7), 
M=| -+0-) l- 201-5-) | (5.148) 
0 1 


We have left the matrix multiplication to the dear reader. Here L is the length of each 
dipole, 0 is the bend angle and f is the quadrupole focal length. The upper 2 x 2 was 
obtained before, and now we have information on the dispersion. 

The dispersion in the middle of the focusing quadrupole Dp and its gradient D‘, must 
satisfy the closed-orbit condition, 


Dr Dp 
D, |=M-| DL (5.149) 
1 1 


which leads us to 
LO(1 + $sin $) 

sin? 6/2” 
and D‘, = 0 at the symmetry point in the middle of the quadrupole. The dispersion in the 
middle of the defocusing quadrupole can be found by transforming the dispersion to the 
middle of this quadrupole. 

We’ve seen how to combine alternating gradient quadrupoles to make a focusing struc- 
ture in both planes. This is called the FODO cell and is an example of a basic optical building 
block we use to construct lattices. There are many possible configurations of dipoles and 
quadrupoles that can give stable motion. We can talk about dispersion-free lattices, which 
are important in many applications. These allow bending of the beam without generating 
additional dispersion (known as an achromat). Examples are the Chasman-Green structure, 
triple-bend achromat. We also can build dispersion suppressors, which match the periodic 
dispersion in the arc (perhaps made of FODO cells) into a dispersion-free straight. We can 


Dr= (5.150) 
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also displace the beam transversely without generating dispersion using a sequence of only 
bends. Sometimes called a geometrical achromat. 

Let’s look at achromats in more detail. Consider a simple double-bend achromat (DBA) 
cell with a single quadrupole in the middle of two dipoles that bend in the same direction. 
The role of the quadrupole is to focus the dispersion halfway through the structure and 
allow it to be closed (i.e. set to zero at the dipole exit) by the second dipole. We use the 
thin lens approximation and write down the dispersion matching condition, i.e. we expect 
some dispersion D, in the middle of the quadrupole and feed into the system zero dispersion 


De 1 00 1 L 0 1 L L0/2 0 
o |=|- 10 0 1 0 01 #0 0]. (5.151) 
1 0 0 1 0 0 1 00 1 1 


Here f is quadrupole focal length, 0 and L are the bend parameters and Lı is the distance 
between the quadrupole and bend centres. In essence we match to the D’, = 0 condition 
at the middle of the quadrupole, i.e. the quadrupole turns over the sign of the dispersion 
generated by the bend and the dispersion is a maximum in the quadrupole centre. The 
required focal length is 


f= zı + 1r) (5.152) 


1 
2 
and resulting De is hence 


1 
De = (Lı + 51)6. (5.153) 


Note the dispersion at the quadrupole becomes higher for longer distances and bigger bend 
angles. This analysis shows what is possible, but in practice we need extra quads for match- 
ing and maybe a reduction of the required quad strength by splitting the central quad. 

The optical functions (8,a,Ņy) for a vertical double-bend achromat (DBA) with a 
quadrupole triplet between them are shown in Fig 5.16. Note the bending is done in the 
vertical plane and the structure is achometic in this plane. The horizontal dispersion seen in 
this figure is pre-existing to the achomatic structure. This figure is taken from the lattice of 
the LHeC collider [13, 14] and shows the action of the triplet to focus the vertical dispersion 
from the same-sign dipoles. 


5.7.3 Momentum Compaction 


We have seen that a momentum offset changes the horizontal orbit of a particle through 
dispersion if we have horizontal bending. Ideally, a machine with only horizontal bends 
does not generate any vertical dispersion. However, dispersion does generate a longitudinal 
effect, as the total circumference of an off-momentum particle’s trip around the machine will 
be different to the reference particle. This matters for synchronisation and for longitudinal 
dynamics. What is this circumference, or path length, error? Consider the situation in 
Figure 5.17. The path length in this dipole for the ideal particle is given by p0, and the 
path length for a particle at radius p + x, where x can come from any source, is (p + x)0. 
Hence the path length change due to the particle not being on the design orbit is 


AC = (p + 2)0 — pO = x0. (5.154) 


The change in circumference of the machine, made up of lots of dipoles, is given by an 
integral over the whole ring 


= zco(s) 
ac=¢ ale) ds, (5.155) 
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FIGURE 5.16 A double-bend vertical achromat (DBA) structure from a real lattice, showing the optical 
functions through the structure. The diagram at the top shows the lattice, with blocks for the two vertical 
dipoles and squares above and below the axis for quadrupoles. Note the central defocusing quadrupole 
(square below the axis) turns over the labelled vertical dispersion Dy. Also note the action of the quadrupoles 
on the 8-functions. 


p 

A 

FIGURE 5.17 The origin of momentum compaction, showing the longer orbit travelled at the large 
radius. 
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where we know the closed-orbit distortion around the ring (%co(s)). For the case where the 
closed-orbit distortion is given by a momentum error, we can say 


_ 5 f Pla, 
AC = Fi rane (5.156) 


and so the difference in circumference is proportional to the momentum deviation. Note this 
is because we work with the linear dispersion and in reality the closed-orbit distortion will 
also depend on higher powers of 6. So we define the linear momentum compaction factor 


14C 
c —= =. .1 
a 5C (5.157) 
i AC 
In general, we then have an integral around the ring to compute the momentum compaction 
factor, 
1 [D 
Oe = af (5) as, (5.159) 
CY p(s) 


because a ring has many sources of path length deviation. The momentum compaction factor 
is an important lattice design parameter. A large value means the path length varies a lot for 
offmomentum particles. This means the particles tend to spread out and the bunch length 
becomes long. Similarly, a small value means a shorter bunch length. Typically (D) > 0, so 
the particles tend to orbit on the outer side of the ring. 

In this section we have looked at trajectory changes that depend linearly on the mo- 
mentum deviation, so that x = D(s)d. In general we can have an arbitrary dependence of 
the transverse position on the momentum deviation, and write 


z = D16+ Dod? +... (5.160) 


where Dy, is the linear dispersion (the kind we have discussed in this chapter so far) and 
Dz is called non-linear dispersion, or second-order dispersion. We shall discuss these kinds 
of ideas more in the section on non-linear dynamics shortly. 


5.7.4 Chromaticity 


We have seen that dipoles cause orbit changes to particles due to their spread of momentum. 
This is dispersion. Now let’s think about focusing errors due to these off-momentum parti- 
cles in quadrupoles. Consider some particles of slightly different energy passing through a 
quadrupole, as shown in Figure 5.18. 

Higher-momentum particles have a greater beam rigidity than the reference particle, 
and so are deflected less when passing through a fixed magnetic field. This means focusing 
is momentum-dependent and the particle’s focusing will change with momentum. Similarly, 
a lower-momentum particle will be overfocused by the quadrupole field. This means the 
machine’s -function and tune will depend on momentum deviation. This effect is referred 
to as chromaticity. If the machine tunes depend on the momentum deviation, we can write 
linearly in 6 

Vey = Vx,y(0) + Ezy (5.161) 


where we’ve defined the linear chromaticity for each plane ér y. Non-linear chromaticity is 
an obvious extension, giving shifts to the machine tune dependent on 6 to higher powers. 
This is a topic for a more advanced treatment, but for now it’s good to know of its existence. 
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FIGURE 5.18 Different focusing of a quadrupole lens. The solid ray is the nominal ray, for which the 
quadrupole field is designed. The short-dashed ray is overfocused and the long-dashed ray is underfocused, 


corresponding to too little and too much momentum respectively. 


To analyse linear chromaticity we return to the equations of motion, but this time keeping 
all terms containing x and 6. We proceed in the same way as we’ve done before, but when 
we expand the various terms, we keep the term z. we previously dropped. This generates 
a chromatic term in our equations of motion 


x" (s) + (rato) en :) = : ! (4 a 4) xô (5.162) 


where we defined as usual 


g 1 
kz = — + 5. 5.163 
B F (5.163) 
We can think of these chromatic terms as a quadrupole field error of strength 
AK, = (+E) (5.164) 
x AT f 
A similar analysis in the vertical plane would have found a chromatic perturbation of 
g 
AK, = —ô. 5.165 


We already know how to compute the effect of a quadrupole field error. Recall the tune 
shift from a quadrupole error k(s) in our lattice 


1 
Av = TEOLO] (5.166) 


which means we can write down the tune-shift arising from the chromatic perturbation 
term, 


Av = + f asa(s)(—1) (5 4: £) ô. (5.167) 


This expression is linear in the momentum deviation and gives us the tune shift for 
this focusing error. It is conventional to define the horizontal tune change per unit 6 as the 


horizontal chromaticity 
1 2 g 
E 7E f asa(s) (4 + Z) (5.168) 


We call the chromaticity ‘natural’ as it arises from the quadrupoles which make up the 
lattice. Any lattice with quadrupoles naturally generates this chromaticity. Similarly, in the 
vertical plane, 


= ae faso 2. (5.169) 
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The horizontal 6-function is biggest in horizontally focusing quadrupoles (and vice versa), 
so the natural chromaticity is normally negative in both planes. The linear chromaticity Q’ 
is sometimes written as the linear change in the tune 


AQ = Q'6. (5.170) 
For a FODO cell we can show that 
br — Bp 
m 171 
fp = - P, (5.171) 


which is a very useful expression and is proportional to the difference in 6-functions at the 
F and D quadrupoles. Chromaticity is naturally generated by any focusing lattice, so when 
we have non-zero k we have chromaticity and it tends to be negative in both planes x and 
y. 

The optics of the LHC long straight section were shown in Figure 5.11. The chromaticity 
generated in the strong quadrupoles increases with the 6-function, and so large chromaticity 
is generated in the quadrupoles around the LHC’s interaction point. This is an unavoidable 
consequence of the mini-beta layout. 

The chromaticity number tells us how much the tune shifts for a unit shift in the mo- 
mentum deviation (Ap/p = 1). So given the beam has an energy spread, it tells us the 
spread of the tune of the beam. Tune is a finite region in tune space. If we measure the 
beam’s frequency spectrum by a pick-up device and perform a Fourier analysis, we’ll see 
spikes at the fractional part of the tune, and the width of the spike will give an estimate of 
the chromaticity. 

How do we correct chromaticity? Well, it basically comes about when a particle which 
is slightly off-momentum sees a different quadrupole field than it should and this particle 
is focused differently from the others. So in essence we need a correcting device which has 
some kind of transverse position-dependent focusing. A sextupole! A sextupole field has 
field components given by 


S 
B, = Szy ,By = ae —y’), (5.172) 
where S defines the sextupole strength, d?B,/dx?. Note the field is quadratic in x and 
y, and also (for the first time) we see products of x and y in our equations, known as 
coupling. A sextupole couples the beam planes. An off-momentum particle passing through 


the sextupole has displacement 
x = xg + Do, (5.173) 


with y = yg in the vertical plane. And so the fields seen by the particle are found by 
substitution 


Bz = S(xg + Dô)yg 
= Sxgyg + SDõyg (5.174) 
and 
B, = Pu — y3) + SDéazg + 5 262, (5.175) 
yY 2 B B B 2 


There are many terms here, some helpful and some harmful. The helpful ones for us are 


B; = SDôyg 
B, = SDéz~, (5.176) 
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where the horizontal dispersion function has made each sextupole into a quadrupole with an 
effective gradient S.D.6. We can use these to cancel the natural chromaticity in the lattice 
and cancel the chromatic tune shift. But it’s not all perfect. Remember we ignored plenty 
of terms in the fields of the sextupoles; some of the terms are good and fix our chromaticity, 
but some are bad and introduce non-linearities and coupling into our accelerator ring. These 
terms can harm the beam. It is not possible to represent sextupoles in our linear formalism, 
and often the best way to understand the impact of sextupole fields is to track particles with 
matrices, and stopping to be more careful every time a sextupole is encountered. This leads 
to the study of a machine’s dynamic aperture, or what amplitude of particle can survive 
for many turns. To get stable solutions for the off-momentum particle, we need to put 
sextupole magnets and RF cavities in the lattice beam line. Such nonlinear elements induce 
nonlinear beam dynamics and the dynamic acceptances in the transverse and longitudinal 
planes need to be carefully studied in order to get sufficient tolerance or acceptance (for 
long beam current lifetime and high injection efficiency). For the modern high-performance 
machines, strong sextupole fields to correct high chromaticity have large impact on the 
nonlinear beam dynamics and this is one of the most challenging lattice design issues to 
deal with. In the real machine, there are always imperfections in the accelerator elements. 
So, one also needs to consider engineering and alignment limitations or errors, component 
vibrations, and so on. Correction schemes such as orbit correction and coupling correction 
need to be developed, involving elements such as dipole correctors, skew quadrupoles and 
beam position monitors. 

So, to close, how can we measure the chromaticity? Generally in science we change 
something to measure it and so we change the beam momentum and make a linear fit of 
the tune. For more details see [3]. 


5.8 Beams of Many Particles, and Emittance 


So far we’ve defined the single-particle emittance A (or action) of a particle, 


x(s) = V/2AG(s) cos(w(s) + Yo), (5.177) 


which was the constant in our Courant-Snyder analysis and defines the amplitude of the 
motion. The motion of an individual particle is then completely specified by its single- 
particle emittance A and by its initial phase wo. Different particles will have different single- 
particle emittances and initial phases but they all have the same Courant-Snyder functions, 
at least for beams with no momentum spread. Therefore, each particle has its own invariant 
ellipse, with areas fixed by its value of A, around which the particle slides as it moves through 
the lattice. This is what we saw earlier in this chapter. The particle with z = x’ = 0 has 
zero emittance and always stay at x = x’ = 0. This is called the ‘ideal particle’, and does 
not exist in practice. Before we worry about definitions of whole beam emittance (or simply 
called emittance), let’s think more about single particle motion. Dropping the initial phase 


we get 
x(s) = V2A6(s) cos(¢(s)). (5.178) 


Now recall we wrote the invariant equation as 
x? + (Bx! + ax)? = 2AB, (5.179) 


which can be interpreted in the (x, 8x’ + ax) plane as a circle of radius \/2AB. So if we 
use these coordinates, particles move on circles in this plane. This can be a very useful 
concept. If we have the challenge of representing a beam of particles, containing many 
different values of A, we can now see a possible definition of the overall beam emittance — 
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we choose one circle, corresponding to one particle, to represent the beam, which includes 
a certain fraction of all the particles in the beam. If we transform back to an ellipse, this 
representative ellipse plays the same role. This assumes some distribution of particles in 
phase space, which we shall come back to and the precise analytical distribution depends 
on how the beam is prepared or stored and can often be taken to be a simple analytic form. 

So, in other words, we always have more than one particle in our beam and so, generally, 
need to understand how to characterise a beam of particles, each with their own value of 
A. We can choose one of the particle’s emittances to represent the emittance of the entire 
beam or choose some other number to represent the beam as a whole. For example, 68% 
of all particles, or 95%, or some definition based on the typical value. This one number is 
called beam emittance, or emittance. 

Very often an expression for the emittance based purely on knowledge of the particle 
distribution is useful. As an example, when we make simulations we have access to all the 
positions and angles in the beam and so we can define the RMS emittance as 


€rms = y (x?) (x!2) — (aar’)?, (5.180) 


In this expression we have defined the beam distribution moments as integrals over the 
particle density p(x, x’) as 
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Note that we can write these expressions as sums over a finite number of particles N in a 
form such as 


a) = x ti (5.182) 


for when we deal with numerical representations of particle beams (for example, in a beam 
simulation). 

How does this relate to our definition of the single-particle emittance A? We have defined 
x(s) for a single particle, and so its derivative is (where a is the Courant-Snyder parameter) 


2A 


SSB 


(cos(¢(s)) + asin(y(s))). (5.183) 


We have already observed that this corresponds to the particle moving around an ellipse as 
the coordinates (x, x’) evolve, with the area of the ellipse being specified by A. This motion 
can also be understood in terms of two alternative variables to (x, 2’), namely the size of 
the ellipse the particle moves around, A, and the angle around the ellipse, Y. This is entirely 
equivalent to x and x’. If we use (A,w) to describe the particle, then the transformation 
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linking the two descriptions is 


z y 2AB cos(w) 


go = -J> (cos(w) + asin(y)). (5.184) 


The variables A and w are known as action-angle variables, as we know very well now, is a 
conserved quantity for a given particle. We can expect our beam to have a uniform range 
of values of ~ so the average value of a collection of particles, each with their own value of 
A would be 


II 


(£?) = 28(s)(A cos(w)). (5.185) 


Here the angular brackets mean we average over all particles in the bunch. If all the particle 
angles are randomly distributed and uncorrelated with A, then we can write 


(x°) = B(s)(A) (5.186) 
or, defining (A) = e, 

(x?) = B(s)e. (5.187) 
Similarly, we can use the derivative of our expression for x to obtain (xx’) and (2’”), ob- 
taining 

(xx') = —a(s)e (5.188) 
and 

(x?) = y(s)e. (5.189) 


Combining these we obtain our expression for the beam emittance € in terms of our beam 
moments, 


Ems = y (x?) (x!2) — (aar’)?, (5.190) 


The RMS emittance of a beam is useful because we can simply sum over the coordinates of 
known particles and it coincides with the single-particle emittance of a beam in a circular 
machine (with all particles sitting on an ellipse with the same single-particle emittance). 
Now imagine we had a beam distribution in A and Wọ, which is just a collection of 
particles in our machine. For example, imagine we had a bunch of particles uniformly 
distributed in wo and Gaussian distributed in (x, 2’). The link between A and (a, 2’) is 
given by 
yr? + 2axa' + Ba’? = 2A, (5.191) 


and so we can write for the particle density 


Wigs). = exp(— 


N 

1 yx? + Qaxa! + Bax’? 
Z exp ( ) 
1 


2€rms 


=, tag ( a = ay =) (5.192) 


We can fix the normalisation by requiring 


/ av f dx’ W(x,2')=1 (5.193) 
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to obtain 


1 x? + (ax + Ba’)? 
V(r, 2’) = e OP ( ) : (5.194) 


If we perform the integration for the second moments of the beam distribution, defined as 
above, we now obtain 


(x?) = Berms, 
(xz) = —O€rms; 
(x?) = ‘Y€rms- (5.195) 


To obtain an expression for the average beam emittance in terms of these quantities, we use 
yr? + 2axa' + Ba’? = 2A, (5.196) 
and, taking averages, we obtain 


2(e) a(x?) + 2a(xa’) + Bix!) 


Erms (278 ~ 2a”) 
= Des (5.197) 


II 


And so we find our RMS definition of the emittance to be the same as the average value of 
our single-particle emittances, 
(€) = &ms- (5.198) 


We can show that the ellipse with a single-particle emittance of €rms corresponds to 68% of 
the particles in the beam. This is left as a very instructive exercise for the reader. 

Finally we close this section with the concept of normalised emittance. The beam emit- 
tance we have discussed in this section is also known as the geometric emittance e. If we 
increase the momentum of the beam (i.e. via an acceleration process) then the transverse 
velocities remain constant whilst the longitudinal velocities increase; the emittance of the 
beam reduces x 1/8y where 8 and y are the usual relativistic parameters. This process 
is known as adiabatic damping [3]. It is useful to introduce the normalised emittance €y, 
defined as 

en = Bye. (5.199) 


In the absence of other processes, ey remains constant under acceleration and does not 
depend on the momentum of the beam; in the case of high-energy electrons where 3 ~ 1 to 
a good approximation, we have ey = Ye. 


5.9 Longitudinal Dynamics 


In this section we shall explore some dynamics of the longitudinal plane, focusing on longi- 
tidinal stability from a beam dynamics perspective. The detailed discussion of RF cavities 
and their fields can be found in Chapter 3. 

So far we have studied transverse motion using (x, x’) and (y, y’), so motion has been 4D 
and purely in the transverse planes. Now we need to study the remaining direction, involving 
the coordinates in the longitudinal direction. This is called synchrotron motion, and we 
need to worry about energy gain, longitudinal stability and how we focus in accelerating 
structures. In analogy with our study of transverse motion, we could expect to use s and s’ 
as the longitudinal coordinates, and proceed in much the same way. In fact, instead of s’, 
we use the momentum deviation 6 or the energy deviation. But it makes no fundamental 
difference. 
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FIGURE 5.19 The RF waveform and particles arriving either at the design time in the waveform, early, 
or late. The system is designed so that the design particle sees for zero field and an increase in particle 
energy decreases the arrival time at the next cavity. The filled particle is the synchronous particle, the grey 
particle arrives too late and the unfilled particle arrives too early. 


All accelerators, or at least what are now known as conventional accelerators, use radio- 
frequency cavities to accelerate. These cavities also provide longitudinal stability, as we 
shall see, and this is very important in accelerator design and operation. The RF field 
varies sinusoidally in time, hence only particles arriving at the correct time will get the 
design acceleration. A real bunch of charged particles has a finite bunch width and hence 
some of the particles will arrive too early or too late (possibly due to having too much or too 
little momenta), and hence will experience a different accelerating voltage than the centre 
of the bunch, dependent on the arrival phase of the bunch. In designing a machine we will 
choose the phase at which the particles will arrive at the cavity, known as the synchronous 
phase, @s. 

In the linac, the phase is defined relative to the maximum of a harmonic voltage, and 
so ¢; = 0 corresponds to maximum acceleration (known as being on-crest). This is because 
linacs are generally operated close to the maximum in voltage. A different definition is 
generally used in a circular machine, with ¢, = 0 corresponding to a minimum in the 
harmonic voltage. Hence ¢,=0 provides zero acceleration. 

Taking the linac definition, if the synchronous phase is between 0 and 7/2, then particles 
arriving late will get more acceleration, and late particles will get more acceleration. The 
opposite is true if the synchronous phase is between —7/2 and 0, where early particles will 
get more acceleration. This is shown in Figure 5.19, where the circles represent particles 
arriving at different times or phases, for a synchronous phase of 7/2 in the linac definition. 

Now we come to a very important principle, and one which makes accelerators work — 
the principle of phase stability. In order to achieve stable acceleration we would want the 
time it takes for particles to reach the next RF cavity (or return to the same RF cavity in 
a synchrotron) to be slightly longer for early particles or shorter for late particles. This will 
provide a restoring force to particles towards the bunch centre and ensure that particles do 
not slip in phase so much that they are no longer synchronous with the RF. In the next 
two sections we shall see how we can achieve this for circular machines and then in linear 
machines. 


5.9.1 Longitudinal Dynamics in Circular Machines 


One way of proceeding would be to define longitudinal lattice functions, in perfect analogy 
to our studies of transverse beam dynamics, and this can be done. However, synchrotron 
motion is very slow compared to transverse motion and this approach is not the most natural 
way to do things. However, we can define a synchrotron tune, which turns out to be much 
less than the transverse tunes and we can write 


Vay > Vs, (5.200) 
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where vs is our longitudinal, or synchrotron, tune. Because the motion is so slow, we can 
ignore the s-dependent effects around the ring, and avoid a longitudinal Courant-Snyder 
formalism. 

In circular machines, to get the same magnitude and phase of accelerating field every 
time a particle travels around the ring and returns to the cavity, the RF frequency w has 
to be an integer multiple h of the revolution frequency wo 


WRF = hwo, (5.201) 


so the beam always sees the correct accelerating field and gains the correct amount of energy. 
In these equations h is known as the harmonic number. But what if h is slightly wrong, 
i.e. h=110.0000000001 instead of 110 exactly? Then the next turn the field seen by the 
particle will be slightly different than what is needed. Then, after many turns, the beam 
will be increasingly out of phase with the RF system and will no longer be accelerated. 
We surely need to be tolerant to very small errors in the frequencies as a beam is made 
up of particles with a spread of phases. This was resolved by the very important principle 
of phase stability, discovered independently by Edwin McMillan and Vladimir Veksler in 
1945 [15, 16]. For stable motion we choose our RF frequency, which fixes the synchronous 
particle. Now, particles with slight deviation in longitudinal coordinates will oscillate (albeit 
slowly) around this synchronous particle. 

Our cavity is designed to generate a time-dependent longitudinal electric field to transfer 
energy from this field to the particle, as we discussed in Chapter 3. The RF voltage applied 
to the particle is sinusoidal in time, 


V(t) = Vo sin Wret, (5.202) 


and if we pick the RF frequency to be an integer multiple of the revolution frequency the 
beam sees the same voltage every time it crosses the cavity. This is called synchronism, 
and can be written as wrr = hwo. So now the cavity is set up so that the particle at the 
longitudinal centre of the bunch, called the synchronous particle, acquires just the right 
amount of energy and it sees the same voltage each turn. 


V(t) = Vosin(wrrt + ġo) = Vosin(¢st). (5.203) 


In the case of no acceleration, the synchronous particle has ¢, = 0, and so it sees a zero of 
the harmonically varying voltage. Referring to Figure 5.19, consider now another particle 
arriving at some other phase @. If a particles arrives early, it sees too little voltage, so that 
o < @,, and if a particles arrives late it sees too much voltage, so that ¢ > ¢,. If we want 
to accelerate, we choose 0 < $s < m so that a synchronous particle gains energy on each 
turn of the machine. 

Let’s consider our ring, for which the synchronism condition is fulfilled for a phase @s. 
This could be accelerating or not, as it doesn’t matter here. Consider the sinusoidal RF 
waveform in Figure 5.20. 

The effect of the slope in the voltage function depends on the particles energy. There 
are two effects to consider, the particles velocity and the path the particle takes around the 
ring in the accelerator’s dipole and quadrupole fields. At low particle energies (compared 
to the rest energy) the dominant effect is the change in the particle’s velocity, and hence 
the revolution time, while at high energies where the particle is travelling close to the speed 
of light the change in momentum causes the particle to have a larger or smaller bending 
radius and so the particle with higher energy will take a longer oscillating path around the 
ring. 

First, let’s take the low-energy case. Let eV, be the energy gain in one cavity for the 
particle to reach the next cavity with the same RF phase. The points in (energy, phase) space 
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FIGURE 5.20 The RF waveform, showing stable and unstable fixed points. 


where this happens are called fixed points, here Fı and F>. This is shown in Figure 5.20. 
Imagine a particle arrives a little later than the synchronous particle. So it sees a slightly 
later phase of the RF waveform. This is the point G;. This means it gets a larger energy 
kick, so has a higher velocity and gets around the ring faster. This means it arrives slightly 
earlier than it did, and hence moves towards the fixed point F). 

Similarly, an early particle will see F,, get a smaller kick and move towards F1. Hence 
F is a stable fixed point. Therefore an increase in energy is transferred into an increase in 
speed, hence a quicker time to the next cavity. E; and G will move towards F3. This is 
the principle of phase stability. This means the particles oscillate around the synchronous 
phase, and have a natural spread of momenta. 

Let’s play the same game for the other fixed point Fz. Here, if we follow the same logic 
we find the points Es and Gz move away from F». Hence we call Fh an unstable fixed point. 
So we can classify fixed points as either stable or unstable. 

So this works if an increase in energy becomes a decrease in time to the next cavity. What 
happens if the particle is moving at the speed of light? This means that gaining energy does 
not increase its speed. For this case, higher energy translates into a longer revolution time, 
which means F» becomes a stable point and Ff, becomes an unstable fixed point. Now Ey 
and G, will move away from F, (which is now unstable), while Ey and Gə will go towards 
F (which is now stable). This arises because particles with lower energy move on an inner 
dispersive orbit, with a lower revolution time. 

So, the stability behaviour changes as the particles accelerate and become relativistic 
and this change of behaviour — when F and F> swap between stable and unstable points 
is called transition. 

Let’s look at this more carefully now. Particles with different momenta travel on different 
paths and we know the revolution time T depends on the circumference, C, taken by a 


particle and its speed, v, 


T=", (5.204) 


The fractional revolution frequency for a slightly different circumference and speed is there- 
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fore given by 
Af AT AC Av 
= = ; 5.205 
f T C v ( ) 
What this means is the particle arrival time is affected both by a longer path around the 
machine and also by the particle moving faster. We can relate both these contributions to 
the fractional momentum deviation, 


F 2 n (5.206) 
Here we have defined the phase slippage factor 
1 1 1 
P A P a 


In this discussion we have used the useful equations 


Av 1 Ap 
Ta (5.208) 
y P 
ane AC A 
p 
SF = Ae. 2 
= ; (5.209) 


The quantity yr is called the transition gamma and is related to the momentum compaction 


factor of the lattice through 
1 


Vee 
Below the transition energy we have y < yr and so 7 < 0. So a higher-momentum particle 
has a revolution time shorter than that of the synchronous particle and so makes a single 
turn back to the cavity in a shorter time. This means our fixed point F is stable and F> is 
unstable. 

Above the transition energy we have y > yr and so 7 > 0. Now the opposite is true. 
Higher-momentum particles have a revolution time greater than that of the synchronous 
particle. This means our fixed point F} is unstable and F is stable. At the transition energy 
the machine is isochronous (same revolution time) for all momenta and all particles circulate 
with the same period. This is 7 = 0. The point of transitioning from below transition to 
above transition is a dangerous time for the machine, as longitudinal confinement is briefly 
lost and the RF phase suddenly has to jump from one stable region to another. 

Now that we have stable regions in longitudinal phase space, we can start to study the 
longitudinal dynamics. The derivation of the longitudinal oscillations in a circular machine 
is beyond the scope of this introductory book, but let’s sketch some important ideas. If 
we give the parameters of the cavity we can compute the motion in the longitudinal phase 
space. This is called a phase space portrait. In the transverse plane we used the variables x 
and x’, which made sense for this motion, but in the longitudinal place it’s more common 
to use the particle phase difference from the synchronous phase and the relative energy of 
the particle. We see regular motion around the stable fixed points, and unstable motion 
elsewhere. The dividing line between stable and unstable motion in this plot is known as 
the separatrix, shown in Figure 5.21. The separatrix starts from a point very close to (but 
not exactly at) the unstable fixed point, moves away and forms an ‘alpha’ or fish shape 
around the stable fixed point. The area of stable motion enclosed is called the bucket and 
there is one bucket per RF period. In the LHC the RF system oscillates at 400 MHz, the 
stable regions (buckets) are separated by 2.5 ns and we fill every 10th bucket with protons. 


yr (5.210) 
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FIGURE 5.21 Longitudinal stable and unstable motion, written in terms of the variables (¢, ô). The 
synchronous phase is Øs. The boundary between stable and unstable motion is called the separatrix. 


A complete and quantitative discussion of this topic can be found in [3]. However we can 
learn something by sketching out the analysis. We proceed by looking at energy balance for 
one complete turn of the machine, balancing the energy gained by a particle when arriving 
at the cavity at some varying phase to the energy lost per turn by the particle as it moves 
around the ring. The change in the particle energy is the difference between these two 
quantities and this leads to a first-order differential equation for the rate of change of the 
particle’s energy. We can also obtain a differential equation for the rate of change of the 
particle’s arrival phase turn by turn, obtaining 


. 2 1\ AB 
Aġ = < (a. +). a (5.211) 


In this equation q is the particle charge, To the revolution time and £ is the particles relative 
velocity. It is an equation for the rate of change of the arrival phase w in terms of the energy 
deviation AE/E. What this says is that there are two ways for a particle to pick up a phase 
difference with respect to the synchronous particle, and both are related to the energy error 
with respect to this synchronous particle. The first term arises as the circumference of the 
machine for an off-energy particle is different than the design circumference. We learned 
all about this when we looked at the momentum compaction. The second term comes from 
the fact that an off-energy particle has a different speed than the design speed. Both terms 
are related to the energy deviation and have opposite sign in most cases. The relative size 
of the two terms determines if a machine is below or above transition. If we combine our 
equation for the rate of change of arrival phase with the equation for the change of energy 
we can obtain a second-order differential equation for the energy, and an expression for the 
synchronous frequency. For full details see [3]. 


Longitudinal Dynamics in Linacs 


In linacs each bunch only passes through each cavity once and the bunches are almost always 
being accelerated with lots of RF cavities closely spaced together. Turning our attention 
to a linac, only the particle speed changes matter for longitudinal stability as there are no 
dipoles and hence no momentum compaction. This means that the subject of longitudinal 
dynamics is mostly concerned with proton and ion beams up to a few GeV, and the very 
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start of electron linacs up to a few MeV (in the case of electrons they will become relativistic 
in the first few cells of the first RF cavity they see). The change in particle energy from 
cell-to-cell means that the relativistic 6 of the particle beam changes along the length of 
the linac. Therefore the length of the accelerating cells, numbered 1 to n, increases along 
the length of the linac. The approximate length of cell n+ 1, to have the particles enter this 
cell at the same phase as the previous cell, is given by 


Un Pa dun Ly, Ta 


Qn f dz Anf ’ 


Lany = (5.212) 


where vp is the velocity of the particle entering cell n + 1, and hence leaving cell n, with 
phase advance, a, and f is the frequency. Replacing vn with n, which is the ratio of the 
particle velocity to the speed of light of the particle entering cell n + 1, we can find [17] 


Dnt = y Ta (5.213) 


We may choose to use a longer or shorter cell length to have the synchronous phase vary 
from cell to cell. 

Many linacs that require short relativistic electron bunches will include magnetic chi- 
canes, using four dipole magnets, in order to create a difference in path for high- and 
low-energy particles, such that a beam with a variation in energy along its length will ex- 
perience momentum compaction but only inside the chicane. The velocity gradient, d3/dz, 
is related to the accelerating gradient and chosen as a compromise between peak electric 
fields and the length of the structure to reach 1 MeV. It should be noted that due to rela- 
tivity, d6/dz is not constant for a constant accelerating gradient and decreases to zero as 
the particles gain energy and the particle velocity tends to the speed of light. 

Similarly to circular machines below yr, low-energy electron linacs and low to interme- 
diate energy proton and ion linacs experience longitudinal bunching. In the case of linacs, 
the synchronous phase is chosen in the design of the linac, with the RF cell length cho- 
sen to vary along the linac with increasing particle velocity matched to the acceleration of 
a chosen synchronous particle at the desired design gradient. As mentioned previously a 
particle which arrives early will experience less acceleration and will fall behind, a particle 
arriving late will get more acceleration and catch up. It hence makes sense to again anal- 
yse longitudinal dynamics in terms of the longitudinal phase space discussed previously in 
this chapter, which is a consideration of particle energy versus particle time or phase with 
respect to the synchronous particle’s energy and time/phase for motion in the linac. As in 
circular machines, if we take a particle with a displacement in phase or energy from the 
synchronous particle it will follow a path in longitudinal phase space, with stable particles 
following closed loops and unstable particles following continuous paths that slip from one 
RF cycle to the next. This interface in phase space is again known as the separatrix, shown 
in Figure 5.21. Not every RF bucket will necessarily be filled with particles in a linac as 
there may be other reasons to want to space bunches out in time as we will see in Chapter 
T: 

By analysing the Hamiltonian of the acceleration in a linac (assuming smooth continuous 
acceleration), we can define the maximum deviation from the synchronous particle at the 
synchronous phase and the maximum phase deviation at the synchronous energy while still 
providing stable acceleration [18]. The maximum energy difference, AKmaz, also known as 
the energy acceptance, is given by 


AK meon 2 Face 363 : 
= y a VPA cg cos @s — sin gs), (5.214) 


mec? amc? 


Single Particle Motion 215 


where Face is the accelerating gradient, and A is the wavelength of the RF, and q is the 
particle charge, and the subscript s donates the synchronous particle’s properties, with ¢, 
being the synchronous phase, 7, being the Lorentz factor for the synchronous particle, and 8s 
being the relative velocity of the synchronous particle. This implies that a synchronous phase 
of ¢;=0 gives no energy acceptance and hence would not be suitable choice of synchronous 
phase. For the maximum allowable phase deviation (the phase acceptance), for any given 
synchronous phase, we solve the motion for the case where AKmaz=0 and find two solutions, 
one for either side of the synchronous phase, ¢; and ¢2, for the early and late particles 
respectively. We find one solution is ¢; = —¢, while ¢2 is given by 


sin d2 — Q2 cos ds = sin bs — Qs COS ds. (5.215) 


For small ¢, we find the phase acceptance is from —@, to 2¢,. We find that both the energy 
and phase acceptance of the linac increases with increasing ¢,. However, the accelerating 
gradient decreases as cos¢@, hence we do not want to have ¢, too large. Typically syn- 
chronous phases of around 20° are chosen as a good compromise. The electrons will now 
oscillate with simple harmonic motion with a frequency, w; equal to 


2 = 2 9B acc’ aR : (5.216) 
2nmc?73 Bs 
where wọ is the RF frequency and the amplitude is dependent on the particle’s initial 
deviation from the synchronous particle in energy and phase. As can be seen, the frequency 
of the synchrotron motion in linacs is energy dependent, due to the increase in the Lorentz 
factor, with the frequency decreasing with increasing beam momentum. For sufficiently 
high energies the oscillation period will be longer than the length of time the bunch takes 
to traverse the linac and hence can be neglected. 

Let’s now think a little about transverse dynamics in linacs. We clearly need some kind 
of transverse stability, as we don’t want the accelerating particles drifting off to larger 
transverse position as they accelerate. In Chapter 3 we discussed the accelerating action of 
RF resonant cavities and we see that when a particle in a cavity feels a longitudinal electric 
field, it also feels transverse fields. This means the particle receives transverse momentum 
kicks. Note the kicks felt as the particle enters and leaves the cavity are different as the 
field changes in time as the particle moves through the cavity and vary with radius. To get 
a feel for the necessity of transverse forces, imagine we transform to the rest frame of the 
particle in the cavity and hence only worry about electrostatic forces. These are described 
by Laplace’s equation for the potential V in 2 dimensions, 

2 2 
i oY =0, (5.217) 
dx? dz? 


meaning it is impossible for both the transverse (x) and the longitudinal (z) to be focusing 
(a minimum in V (z, z)) at the same time. For a full analysis of the transverse focusing from 
the changing fields in a cavity see [19, 18]. For longitudinal stability we need a synchronous 
phase off-crest between 0 and 7/2. However, operating at synchronous phases other than 
0° causes the beam to have a transverse voltage component. In a perfect pillbox cavity the 
longitudinal electric field is constant along the length of the cavity, however the introduction 
of the beam-pipes causes the longitudinal electric field to vary along the length. Gauss’s 
law states that if the longitudinal electric field has a longitudinal variation then there must 
also be a radial electric field that varies radially. The radial electric field coupled with the 
azimuthal magnetic field gives rise to a transverse force that is zero on the beam axis (r = 0), 
but due to the finite bunch radius, the edges of the bunch experience the transverse force. 
If the beam is accelerated at an RF phase of 0° (i.e. the electric field is maximum when 
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the beam reaches the halfway point of the cavity), then the transverse force at the first 
half of the cavity exactly cancels the force in the second half of the cavity for relativistic 
particles where the Lorentz factor doesn’t change significantly over a single cell. If however 
the linac is designed with a non-zero synchronous phase, then there will be an RF focusing 
or defocusing term as the forces at the cavity entrance and exit no longer perfectly cancel 
as the beam reaches them at different phases of the RF. If the synchronous phase is chosen 
to be longitudinally stable where early particles receive less acceleration, then the RF is 
radially defocusing, and vice versa. The RF defocusing force, F;., is a function of beam 
radial offset, r, and is given by [20] 


. er (dE, 1 B\ OE: 
Fr) =—>5 (= (5; =) T) (5.218) 


where FE, is the longitudinal electric field applying the acceleration, which is a function of 
radius, longitudinal position and RF phase ®. Integration along the beam path, offset from 
the central axis by a distance r, for a pillbox cavity yields 


erm EoLT sin ® 
Ap(r) = PEA (5.219) 


where À is the RF wavelength, and Eo LT is the cavity voltage as defined in Chapter 3. This 
shows the defocusing increases with increasing RF frequency and that, coupled with the 
large bunch lengths captured for a given synchronous phase, means that lower frequencies 
are preferred for proton linacs of low to intermediate energy. While a similar effect occurs 
in electron linacs below 0.5 MeV, this energy can be reached in a few cells hence higher 
frequencies are typically used. An example of this is the ESS linac which uses 352.2 MHz 
up to 201 MeV before transitioning to 704.4 MHz after 201 MeV, allowing higher gradient 
elliptical cavities to be used without using a very large radius structure. It starts with an 
RFQ, which allows the beam to have longitudinal bunching, radial (electrostatic) focusing 
and acceleration in the same structure, up to 3 MeV before being further accelerated in a 
DTL up to 79 MeV, which allows efficient acceleration for low particle velocities. In order 
to run at high duty cycles at higher gradient, the linac must then become superconduct- 
ing so the beam is then injected into a 352.2 MHz superconducting spoke cavity up to 
201 MeV. Above 201 MeV the beam is sufficiently relativistic to increase the frequency up 
to 704.4 MHz utilising elliptical cavities up to 2.5 GeV. 

As well as the RF defocusing, there is also space-charge defocusing as will be discussed 
in Chapter 7. We can provide external transverse focusing to counteract this using magnets, 
but there is no way of providing an external longitudinal bunching force, hence the syn- 
chronous phase is usually chosen to be longitudinally focusing. Strong transverse focusing is 
required in low- to intermediate-energy proton and ion linacs in order to compensate for the 
space-charge effects and the RF defocusing. This means smaller phase advances per section 
are chosen in the lattice design than at higher beam velocities, typically increasing along 
the length of the linac. 


5.10 Non-Linear Beam Dynamics 


In this chapter we have studied beam dynamics of a single particle. The majority of this 
chapter has been the study of linear motion, and we used matrices to represent our trans- 
formations and we use the matrix M as the map that brings an initial state vector X (so) 
to a final state vector X (s1), so that 


X(s1) =M- X(s0). (5.220) 
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We have also considered non-linear motion when we looked at chromaticity, as this involves 
a combination of x and 6. What does non-linear motion look like in general? 

The extension to non-linear motion using the formalism of matrices is straightfor- 
ward [21] if we use index notation. In this notation, we use indices to label rows and 
columns of our matrices and vectors. Now the linear transformation looks like 


X% = > Mă; 
= MyX;. (5.221) 


In this formalism we make extensive use of the convention that repeated indices are summed 
over (in this case 7). This can be extended to higher-order terms in the following way, 


X; = Mi; Xj + Tik; Xk; (5.222) 


where the additional term T;jķ generally describes the non-linear mapping. For example, 
we can think of Tiss as describing the non-linear coupling between momentum deviation 
and position, with a term looking like 


Xı = Mis X6 + Tie X. (5.223) 


Note that Mie is the linear dispersion and Tise is the first non-linear dispersion. 

The majority of non-linear beam dynamics is beyond the scope of this book. However to 
motivate its use, consider a series of beam line elements consisting of an RF cavity, a drift 
and a four-dipole chicane. The RF cavity can change the beam’s phase space by imparting a 
longitudinally dependent change in momentum, called a chirp. A linear chirp would increase 
or decrease momentum linearly when moving from the front of the bunch to the back. This 
chirp is useful when the beam enters the chicane, which essentially provides a structure 
where the particle’s path length depends on the particle’s momentum. This means we can 
adjust the longitudinal size of the bunch by rotating the longitudinal phase space. A short 
bunch is obtained at the expense of a large momentum spread, and vice versa. This linear 
rotation can be modelled by consideration of the Msg element of the map of the overall 
system, obtained from the composite map (all the matrices multiplied in the correct order) 
of the three elements. Similarly, the non-linear term Tes gives the non-linear distortion 
of the rotated longitudinal phase space through the compression system. In essence, it 
describes the non-linear chirp given to the beam by the non-linear compression system. 

So how do we obtain the non-linear maps of our beam line elements? Well, that is a very 
big question and we refer the reader to the many good books available on the topics [1, 22]. 
Some non-linear maps are straightforward. Consider an RF cavity, which acts to boost or 
reduce the particle’s momentum dependent on the arrival phase. This means the longitudinal 
position is unchanged, and the momentum deviation is changed by the voltage and the sine 
of the phase 


d=6- ae sin(wz/c), (5.224) 
Eo 


where w is the cavity frequency and we have followed the formalism of [1, 22]. This map 
is inherently non-linear, as the sine contains all odd powers of z. A series expansion of the 
right-hand side of this expression would generate the non-linear coefficients defined above. 
For magnetic elements, there are many different ways to obtain the non-linear maps. 
We have already, in the section on the correction of chromaticity, obtained the map for the 
sextupole. At the time we did not use the term non-linear map, but that is what it is. This 
was done by using Newton’s dynamics, and integrating the Lorentz force on the particle 
along the length of a magnet. The field of the sextupole is a quadratically-rising field, 


S 
B, = Szy, By = 5 =a); (5.225) 
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where S defines the sextupole strength, d?B,/dx?, and hence the resulting kick to the 
transverse angle is quadratic in the transverse offset. We saw how this could be used for 
chromaticity correction. A more structured method to obtain non-linear maps is to use 
the Hamiltonian for our particle and Hamilton’s equations to compute the dynamics. The 
resulting map can be expressed as a Taylor series in the dynamical variables (e.g. (x, px)) 
using these methods. More advanced and formal tools such as Lie analysis also provide 
methods to extract the maps. For an excellent discussion see [1, 22]. We also note that 
many good references tabulate the maps for many elements, e.g. [23]. When using these, be 
sure you understand the variables and the approximations used. 


Exercises 


1. Imagine a proton storage ring, with a beam momentum of 20 TeV/c and 17,000 proton 
bunches, with 1 x 10!° protons per bunch. 


(a) What is the stored energy of this machine’s beam? 


(b) If the circumference is 83 km, and the field is 6.6 T, what fraction of the ring is filled 
with dipoles? 


(c) The LHC beam energy is 360 MJ. What problems might this cause? 


2. Show, for forces that are constant in time, that the Hamiltonian is a conserved quantity. 


3. Prove that magnetic fields bend the trajectory of a particle but do not do any work 
on the particle. This means the particle energy does not change. Try doing this two 
different ways. 


4. Magnetic forces are generally transverse to the direction of motion. What does this 
mean about longitudinal control of a beam? 


5. Substitute the Courant-Snyder ansatz into Hill’s equation and derive the differential 
equation obeyed by the $-function. Comment on how this could be solved numerically. 


6. We looked at a FODO cell where the focal lengths of the focusing and defocusing 
quadrupoles were the same. Find the focal length of two opposite-polarity quadrupoles 
of focal length f, separated by a distance d. Then, imagine they were different focal 
lengths; what would this mean for the phase advance in the x and the y plane? Assuming 
thin-lens optics, find expressions for the phase advance in each plane for a focusing 
quadrupole with focal length fı and a defocusing quadrupole with focal length fo. 


7. Describe the impact on a beam if a quadrupole of gradient g=500 T/m is displaced 
vertically by a millimetre. What is the impact of the displacement of a sextupole on the 
same beam? 


8. Obtain the expression for 8p in the defocusing quadrupole of a FODO cell. Now explain 
using equations how to propagate these parameters from the quadrupoles and work out 
the 6-function anywhere in the FODO cell. 


9. For a FODO cell with parameters K = +0.54102 m7?, 1, = 0.5 m and Larit = 2.5 m, 
show the phase advance per cell is 45°. 


10. Returning to the derivation of Hill’s equation, derive the inhomogeneous Hill’s equation 
in the presence of non-zero momentum deviation 6. You may need to consult the broader 
literature. 
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11. Show that the ellipse with a single-particle emittance of €rms corresponds to 68% of the 
particles in the beam. 
12. A 3 GHz RF linac for accelerating protons at 100 MeV has an accelerating gradient of 
50 MV/m. If the linac operates with a synchronous phase of 20°, what is the energy 
acceptance of the linac? 
13. (A little more open ended) Implement our linear transport equations for (a, x’) in your 
favourite computer code. Transport some particles through a FODO cell by choosing 
some sensible initial particle coordinates and cell parameters. Now add calculation and 
evolution of the lattice functions (e.g. $-function). 
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6.1 The Origin of Electromagnetic Radiation 


6.1.1 The Fields around a Moving Charge 


We begin by considering a stationary charge q at r’, which has the associated field (for an 
observer at r, and illustrated in Fig 6.1) of 


B=0. (6.1) 


We can argue that a stationary charge does not radiate in two ways. Firstly, we know from 
our earlier discussion that electromagnetic radiation would have to have a component with 
a magnetic field, but we see for this stationary charge that B = 0 everywhere. Secondly, 
energy flow from a charge radiating should vary as S œ 1/r? to satisfy conservation of 
energy; in other words, E and B must vary as E œx 1/r, B œ 1/r if there is electromagnetic 
radiation emanating from the point charge. We see from the diagram that there is no 
component of either field that has this variation, and hence there is no radiation emitted. 
We next consider a charge moving at a constant velocity with respect to some observer. 
Both the charge and the observer are in inertial frames of reference, and so we may perform 
a Lorentz transformation from one frame to the other and still retain the observable which 
is the total amount of electric flux around the charge. A moving charge generates a magnetic 
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FIGURE 6.1 Field lines around a stationary charge, and a definition of the position of the charge, r’, 
and where its electric field is experienced, r. 


field, but here we consider how the electric field of a moving charge appears to a stationary 
observer. We Lorentz transform the spherically-symmetric electric field and see that in the 
direction of motion, the field lines are compressed by a factor 1/7; the electric field is 

q 1-8? î 


HE 4reo (1 — 8? sin? 0)3/2 |r|? (6.2) 


where the polar angle 0 = 0 along the direction of the charge motion. It can be seen that 
the electric flux through a small area element dA still varies x 1/r? as it does around a 
stationary charge (Fig 6.2). One way to think of this is to remember that field lines from 
a uniformly-moving charge are still straight; hence their separation is proportional to the 
distance from the charge, regardless of the direction of the field lines. The area dA traced 
out by any four field lines varies as dA œ r? regardless of the direction away from the charge. 
Hence the field strength still varies as Æ œ 1/r? in any direction, and so again there is no 
emitted radiation. Another way to look at this is merely to transform to the rest frame of the 
charge, where of course it is stationary and therefore not emitting photons; the ‘fact’ that 
it is not emitting photons should still be true if we observe in a different inertial frame. A 
consequence of this ‘compression’ of the electric fields line around a rapidly-moving charge 
is that a detector that is sensitive to electric fields will see a pulse of electric field as a charge 
passes close by; this is the principle by which many diagnostic instruments work, such as 
beam position monitors (BPMs) where a capacitive pickup senses the voltage generated by 
a passing bunch of particles a few millimetres away. 

Another way to consider a charge with constant velocity is to examine the magnetic field. 
The Biot-Savart law allows us to directly calculate the field created by a moving charge. 
According to this law, the field created by a moving charge is 


Ho qv Xt 
B(r) = ———. ; 
U= mE (6.3) 


Again, we see that the magnetic field in any direction only varies as B œ 1/r?, and hence 
there is no radiation. In summary, a charge moving at constant velocity emits no radiation. 
Therefore, radiation requires acceleration of the charge. 
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FIGURE 6.2 Electric field lines around a stationary charge (left), and around a charge moving at 


constant velocity 8 = v / c (right). A moving charge has field lines compressed into a plane perpendicular 
to the charge’s direction of motion, and the spread in field lines has typical width Y ~ 1 / y where y = 
Etotal / Eo. 


6.1.2 Radiation from an Accelerated Charge 
A Displaced Point Charge 


It is possible to use a simple picture of a displaced point charge in order to derive the power 
and radiation pattern of an accelerated charge; this manner of describing the emission is 
generally attributed either to the physicist Edward Purcell, or perhaps earlier (according 
to Malcolm Longair) to J. J. Thomson. 

We imagine a point charge initially at rest which is then subject to a short period of 
acceleration At, after which time it is moving at a constant velocity u « c. Hence u = aAt 
for an acceleration a. A short time later, we may observe two regions with different field 
configurations (Fig 6.3). Sufficiently close to the charge an observer sees the new location 
and speed of the charge, with field lines emanating radially away from it. Far from the 
charge an observer at a distance r still sees the field at the previous, stationary location; 
a time r/c has not yet elapsed to allow the observer to see the new motion of the charge. 
Between these two regions there must be a boundary where the field lines change from the 
old, stationary situation to the new, moving situation, and this boundary moves away from 
the charge’s location at a speed c. We remember that in free space V - E = 0, so there can 
be no discontinuities in electric field lines (this would imply extra charges at the boundary, 
which isn’t true). The moving boundary must therefore appear as a kink in the electric field 
lines. This kink is the emitted radiation. We now calculate its properties. 

By making the assumption that the final velocity u « c, we may say that the electric field 
lines are approximately parallel inside and outside the kink. We now imagine an observer 
looking at a time t at some angle 0 to the final motion of the charge (Fig 6.4); the charge has 
a location ut which, viewed by the observer at 0, appears to be moving at u] t perpendicular 
to the line of observation and ujt along it. We may then relate the perpendicular component 
of the electric field at the kink, F,, to the radial component Fj, in terms of the perpendicular 
motion and the radial motion. This is 


E _ ut 


SS Ee A 
El cAt te ) 


We may see also that u, = a1 At (the component of the motion perpendicular to the line 
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FIGURE 6.3 Illustration of how a small acceleration in a charge initially at P can generate radiation. 
After a uniform acceleration a for a time At, the charge is then moving uniformly at u = aAt. When 
observed at a later time t there will be two portions of field. Within a radius r = ct an observer sees the 
new, uniformly-moving charge; beyond r = ct an observer sees the old field of the stationary charge at P. 
At the boundary r = ct the field lines must still be continuous; hence there is a (small) kink in the electric 
field. 


of observation), so that we may state 


at ar 
c c 
since r = ct. But we also already know that 
Ej (6.6) 


Anegr?’ 


(the same field as if the charge were stationary), so we therefore obtain an expression for 
E, as 
— qa 
~ Amege2r’ 

We see therefore that the magnitude of the kink electric field E} œ 1/r. However, we 
should also notice that the electric field at time t depends on the motion of the charge at 
an earlier time T = t — r/c depending upon the distance of observation r. 

Since we expect that at large distances there will be a plane wave emitted by the accel- 
erated charge such that B is perpendicular to E, we can from this obtain that 

1 


B=-*xE (6.8) 
C 


E, (6.7) 


and that Bı = E, /c. Alternatively, we can directly obtain a similar formula for B, to that 
for E, using Faraday’s Law. Combining E and B, we see immediately that the Poynting 


vector 1 
S = — (E x B) (6.9) 
Ho 


points radially outwards from the accelerated charge along f, and that its magnitude S œ 
EB x 1/r?. 
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FIGURE 6.4 Illustration of how to obtain the magnitude of the perpendicular component FÆ, of the 
electric field at some angle to the direction of charge acceleration and subsequent motion. The ratio of the 
parallel and perpendicular components of the electric field is just the ratio of the parallel and perpendicular 


components of the velocity. 


Radiation Pattern from a Displaced Point Charge 


We see from the previous section that 
a, =asing, (6.10) 
and hence we may write the magnitude of the electric field E(r, t) at some location r as 


o qļla(t — r/c)| sin 0 


E .11 
pep = A (6.11) 
This is illustrated in Fig 6.5. The Poynting flux (i.e. the power flow) is then 
22/4 — “2 
is(r,t)| = 2 |a*(t — r/c)| sinf 0 (6.12) 


1672 ec? r? 
which has units of Wm~? (see Fig 6.6). Note that S x 1/r? as it should. 


Total Radiated Power 


Since we now know the Poynting flux S(r,t) = S(r,6,¢) at a given distance r and polar 
angle (80, ġ) (noting that S has no dependence on azimuthal angle ¢ — see illustration in 
Fig 6.7), we may now integrate over the polar angle 0 to obtain the total power P(t) as 


P(t) = f " sda (6.13) 


where 
dA = 2nr? sin 0d0 (6.14) 
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FIGURE 6.5 Illustration of how the magnitude of the emitted electric and magnetic fields vary with 


observation angle 0. 


6=0 


FIGURE 6.6 2D illustration of how the magnitude of the Poynting vector S (here shown as the distance 
of the solid from the origin, for any given angle 0) varies with observation angle 0. 


is the area of a slice d at an angle 0 of the overall sphere into which radiation is emitted 
(this is illustrated in Fig 6.8). Explicitly therefore, 


T 22 t— 
P(t) = f (gaT td eGA sin 648) (6.15) 
0 1677 €9c3r? 
" 2a?(t — r/o) 
galt- r/c) [" . 3 
P(t) = ———_. 3 ; al 
(t) ona f sin? 6d6 (6.16) 
We may use a trigonometric identity to obtain the integral of sin? 0 as 
Lae T z 1 4] 2 2 4 
sin? 0d@ = sin 0(1 — cos“ #)d@ = |—cos@+ —cos’@| =-+-=-. (6.17) 
0 0 3 o 3 3 3 


Hence we obtain an expression for the instantaneous total power emitted by an accelerated 


charge: 
_ Par(t—r/c) 


P) 6regoc? 


(6.18) 
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This is Larmor’s formula, and we used Edward Purcell’s method to derive it.* Larmor’s 
formula is the basis of all radiation calculations for charges. 


a 


FIGURE 6.7 3D illustration of how the magnitude of the Poynting vector S varies with observation 
angle 0. 0 = 0 points up. There is no variation of emitted power with azimuthal angle œ. 


6.1.3 The Hertzian Dipole 


We may use the same basic argument that we used to obtain the Larmor formula to consider 
an oscillating current element and the resultant radiation that it emits. We will see that 
consideration of the current is equivalent to considering the motion of charge; the radiation 
pattern obtained is the same — it’s just the method that is different. We will see below that 
the Hertzian dipole is the starting point for understanding the radiation emitted both from 
moving charges and from the radio-frequency sources that provide energy to them; both 
situations typically involve oscillatory motion that gives rise to Hertzian-like emission. 


An Oscillating Current Element 
We consider two locations (1) and (2) aligned along the z axis and separated by a small 
distance l that each have a variable amount of charge on them 
qı = +906 *" (6.19) 
q2 = —qoe ““* (6.20) 


*The reader is encouraged to be very careful here, since a number of textbooks will be encountered that 
use c.g.s. units, in which case the radiation formulae look quite different. We remind the reader that 
everything presented here uses the SI system of units. 
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FIGURE 6.8 Illustration of how to calculate the total radiated power by considering slices of the sphere 
into which radiation is emitted, each slice having an area dA = 27r? sin OdO for a particular polar angle 


0. 


such that the current flowing between the two points is 


d f 
T= a = —iwgoe 2. (6.21) 
If the separation | — 0, there is still a current Io = —iwqo. We recall the formula for the 
magnetic vector potential 
> / t 
act) = | UD ay (6.22) 


7 An vi |r — r'| 


for a current distribution j that exists within a volume V’. In the present case where the 
locations (1) and (2) are equidistant about the origin, we may write the vector potential as 


A(r,t) = T) ——2, (6.23) 


where we regard the product I,/ as staying the same when l > 0. We obtain the magnetic 
field in spherical polar coordinates as 


1 # rô rsin 0 
= — (2) 2] 2] 
B=VxA= Zang| or 38 a6 (6.24) 
A, rAg rsinbAg 
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By looking at the terms in the cross product in sequence, we see that the components of B 
in polar coordinates are 


B, =0, 
Bo = 0, 
Bg =  (Iol)k sin 0 (+ i) a (6.25) 
We recall Ampere’s Law in free space is 
VxB= n (6.26) 


so that we may then obtain the components of the electric field explicitly as 


1 2 i eil(kr—ut) 
Dl) (1+ & | —— 
ad (1E) SA, 


los Ate c r 
1 k i 1 ei(kr—wt) 
Eg = I in 0 H } 
° Areo (Iot) er (a kr i) e 
Ey = 0. (6.27) 


These are quite complicated expressions, so it is just as well that we obtained them in the 
simplest way possible — via the vector potential. We may distinguish between two different 
regimes — near-field and far-field — where the far-field regime is the radiative part. 

The near-field regime is when kr < 1; in other words, the distance of observation 
from the dipole is much less than the wavelength emitted. In this case, the dominant field 
components are 

Lo f ei(kr—wt) 
By ~ —(Jol) sin 6———_ , 
$ T ol) r? 
1 2 ei(kr—wt) 
(Yo kr? 


tow 
4TEo c 
ei(kr—wt) 


1 
Eg ~ — (Jol) sin 6-—.— 
0 (Jol) sin kr3 


-2 
AT €9 (6 8) 


The components of E, and Eg look like the field around an electric dipole, and have a 
magnitude which falls as x 1/r? as one would expect. 

The far-field regime is when kr > 1, in other words the distance of observation from 
the dipole is large compared to the wavelength emitted (this is similar to the Fraunhofer 
distance). Now, the dominant field components are 


i(kr—wt) 
Ba(r,t) ~ iL (Iol)k sin 6 _, 
4T r 
1 k i(kr—wt) 
E,(r,t) ~ —i——(Jol)~ sinf (6.29) 
AT € C r 


(note that k/c = w/c”). As we saw before, r L B L E, and 


[Eo] 


EA (6.30) 
|Bol 


We can see explicitly the direction of the electric and magnetic fields (they point in the 6 
and ¢ directions respectively). We also see that E and B oscillate in phase with each other, 
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and that their magnitudes vary as E œ 1/r and B œ 1/r as they should. Combining the 
two components together, we may then obtain the power emitted by a Hertzian dipole as 


2 
Bis) Ne aie ey, (6.31) 
67rEqc? 


There are several equivalent ways to write this formula, but this particular way shows 
explicitly that P œ I?; this is an important fact. 


Radiation Resistance 


We have seen that for a Hertzian dipole, P œ I*. This implies that there is an effective 
resistance for emitted radiation, which depends upon the length of the dipole l and upon 
the frequency w that the current is oscillating at. We see from the formula for emitted power 
that we can define this radiation resistance as 


Po’ 


Rrad = (6.32) 


6reoc3 ` 


Remembering that w = 2rc/À and that c? = 1/uo€o, we can re-write Raq in a number of 


equivalent ways: 
goa mti ane l sity EX? (6.33) 
come Ee NAJ B cas T an i 


In the last of these expressions we have defined a quantity Zo as 


1 
Zo = boc = — ~ 377 Q (ohm). (6.34) 
€9Cc 


Zo is known as the free-space impedance and, as you can see, it has the correct units. Note 
that Zp is often given in an approximate form 


~ 807°, (6.35) 


so that that the radiation resistance may be variously written as 


ry ag pr 
~~ 2 = ~~ mE N EE 
Zo ~ 807 G) ~ 800 G) ~790(5) (6.36) 


As you can see, these equations allow a rough calculation of the radiation resistance (and 
hence the power emitted) for a given dipole emitter, as long as the length of the dipole and 
the emitted wavelength are known. 

As an example, we set l = /4; this is a so-called quarter-wave antenna, also often called 
a monopole antenna. For example, a VHF antenna where À = 3 m (frequency f = 100 MHz) 
could be made with a length l = 0.75 m. This gives a radiation resistance of Rraa ~ 50 Q 
(which you should recognise as a very common resistance found in electronic equipment). 
For a peak current Jo = 100 A flowing through the antenna, the emitted power would 
be P ~ 500 kW. This power is not unusual — it corresponds roughly to the kind of power 
emitted by the UK Winter Hill transmitter, which provides television signals for Manchester 
and the surrounding country. 
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An Oscillating Dipole 


We derived our equations for a Hertzian dipole by considering current flowing back and 
forth between two points. We may equivalently view this situation as two points (separated 
by a small distance l in comparison to the emitted wavelength A) upon each of which charge 
is deposited or removed; the charge on either end is equal and opposite. The charge on each 
end can be described as 


q = +qosinwt (6.37) 


where + is for one end of the dipole and — is for the other end. The current flowing on/off 
the two ends of the dipole is then just 


dq 
I = — = qow cos wt. 6.38 
dt qo ( ) 
In other words, Io = qow, and the oscillating current is equivalent to an oscillating dipole 
moment. The dipole moment of a given charge separation is just po = qol, so we may write 


p = posin wt = qol sin wt. (6.39) 
In other words, we may write 
Iol = Pow, 
Iolw = pow?. (6.40) 
Remembering that when we take an average over time (sin? sin) = E, we can re-write the 
Larmor formula for the time-averaged power as 
Pw? 
(P) = me (‘current picture’) 
pew! 
(P) = Tree (‘dipole picture’) (6.41) 


The dipole picture is more interesting. It tells us what power will be radiated by a given 
amount of charge qo moving over a distance l. The power radiated varies very, very strongly 
with frequency. P œ wt: if the frequency is doubled, the power radiated goes up by a factor 
of sixteen! We will see below why that is of such importance. 

To summarise: when we talk about a Hertzian dipole, we mean: 


e non-relativistic charge motion; 
e a source size 1 << À where A is the emitted wavelength; 
e an observation distance r > A (i.e. kr > 1); 


e the emitted power P œx Ij and P x wt. 


6.1.4 Antennas 


When talking about the Hertzian dipole, we briefly mentioned the idea of an antenna as 
a current-carrying object where the current oscillates with time. We will now formalise 
this concept by discussing different types of antenna. Antennas are hugely important in 
accelerator science, as they are a basic component used to couple electromagnetic power 
from one oscillating system (say, a waveguide) to another system (say, a cavity). 
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The Half-Wave Antenna 


The half-wave antenna is also known as the dipole antenna, and is the conceptual basis 
of many sources of RF power described elsewhere in this textbook. It consists of a cable 
(usually coaxial) that connects to two aerials, one aerial to each of the cable conductors 
(this is illustrated in Fig 6.9). The aerials are not connected to each other, but see the same 
current from the cable. Since this real antenna has ends, obviously current can’t flow out 
of those ends. Hence the current at the end of the antenna must be zero to be physical. 
Without knowing anything else, we can immediately say that the current in a real antenna 
must be maximal at its centre, and zero at the ends; if the antenna is fed with current at 
its centre, we can first assume that I linearly falls away from its centre. With this simple 
linear picture, we obtain that the effective current Ieff = Ip/2, and the effective antenna 
length leff = 1/2. Hence the radiation resistance (also called the impedance) is 


1 (20 i\? 
Rrad > ~ | — Zo | > 200 2 | = ` 6.42 
i TE 0) (5) (6.42) 


Comparing this equation to our previous equation for a 1/4-wave antenna, we find a radi- 
ation resistance Rraa ~ 12.5 Q, a substantially smaller value. Hence for the same current 
in the drive cable we get more power. For example, a 100 A peak current would drive 
Pra = I Rraa ~ 125 KW. 
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FIGURE 6.9 A first approximation to the current J flowing in a half-wave antenna is that the current 
fed into the middle by an AC source falls linearly towards the ends. The two leads that carry current into 
and out of the antenna generate radiation fields that cancel. 


Let’s now do the derivation more properly. We suspect in reality that a half-wave antenna 
doesn’t really have a current along its length that falls linearly; we expect a standing wave 
to be set up, and the lowest mode of that standing wave is one where the current has the 
form of a half-sinewave. In other words, we assume that J is a maximum at the position the 
antenna is fed (i.e. at its centre), and J = 0 at the ends. The standing wave comes about 
because of the generation of transient voltages; the voltage can be momentarily different at 
different points on the antenna because it takes time for the currents to move from place 
to place. Hence we can describe the current at a different point z along the antenna as 


Laie fee (6.43) 


(i.e. there are two waves moving up and down the antenna such that their currents cancel 
at the antenna ends). This is shown in Fig 6.10. We may therefore describe the real part of 
the current in the antenna as 


2 
I = Io cos Z cos wt (6.44) 
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so that J = 0 at z = £\/4, recalling that A = 21 where l is the (total) length of the antenna. 
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FIGURE 6.10 A better calculation of the electromagnetic radiation emitted by a half-wave antenna. As 
well as the variation of current along the length of the antenna, we must also account for the small phase 
difference in the emitted radiation, as seen by a distant (far-field) observer. 


We may now sum up the contribution to the (far-field, radiative) component Eo of each 
of the current elements passing at any moment through a short section of the antenna dz, 
viewed at some distance r at an observation angle 0. This is 


+1/2 I 1 
Eo(r,t) = J a sin 6 sin(wt — k|r — r'|)dz’. (6.45) 
1/2 4Teorc 


Each component dEg of the electric field seen at r contributes with a different phase such 
that the resulting E and B fields are 


BETS 2i cos(Z cos 0) ei(¥t—kr) 


Amegc o sing r : 
2iuo , cos( 3 cos 0) ei(wt—kr) 
mes i 6.46 
an ) An 0 sind r l ( ) 


noting once more that Eg/Bẹ = c as it should. The total (time-averaged) Poynting flux is 
then just 

I2? cos?(% cos0 

(s) = e Gest) 


= 6.47 
8r2egc r?sin?ð ( ) 


The total power radiated by the antenna may be calculated straightforwardly by integrating 
over all angles 0 and ¢; it’s just long-winded. The integration yields 


27 T 

(P) = I J (S) r? sin 0dôdġ, 
0 0 
Ís 2m 


1 T cos*(Z cos 6 
= = / is f S 
4reoc \ 27 Jo o sin? é 
SS 


=1 
2 T 2T E 
_ ió cos“ (35 cos 0) 
4TEoc Jo sin 0 


dé. (6.48) 


Note that the integration over ¢ cancels with the factor 1/27, and one of the factors sin 0 
cancels. The remaining integral can only be carried out numerically, and 


4 cos? (7 cos 0) 
0 


sin 0 


dO ~ 1.22. (6.49) 
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Hence the average emitted power is 


I2 
(P) ~ 1.22—2— x 0.194Z0I2ns> (6.50) 
4TEoC 
remembering that Zo = 1/eoc and J?,,, = Iĝ/2. The radiation resistance of a half-wave 


antenna is therefore just 
Rraa ~ 73.1 Q. (6.51) 


This impedance is close to that of a ‘standard’ 75 Q coaxial cable. We summarise the 


half-wave antenna power as 
I? I? 
(P) ~ 0.194Z > ~ Ba (6.52) 


For example, to obtain 5 kW transmitted power from a single half-wave antenna, we require 


[2 
By ~ 5000, (6.53) 
and therefore Ig ~ 11.7 A. 


Half-Wave Antenna Radiation Pattern 


We saw earlier that a simple Hertzian dipole generates a radiation pattern with power 
distributed as 
(S) x sin? 0. (6.54) 


A realistic half-wave antenna has a radiation pattern distributed as 


cos?(3 cos 0) 


(S) x (6.55) 


sin? 0 


It’s not immediately obvious how these compare, so let’s plot them. One can see in the 
figure below that the half-wave antenna is more directional than a simple Hertzian dipole. 


0=0 


FIGURE 6.11 Comparison of the radiation pattern from a Hertzian dipole (dashed line) with that of a 
half-wave antenna (solid line). The half-wave antenna has a more directional output. 
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6.2 Radiation from Moving Charges 


We now consider a most important aspect of particle accelerators; the phenomenon that 
the charges moving within them radiate. In this section, we will consider some simple cases 
of that radiation, although of course there are many more complex situations. 


6.2.1 Cyclotron Radiation 


We first consider a non-relativistic charge (in other words, y ~ 1) moving through a uniform 
magnetic field B which is oriented perpendicular to the velocity v of the charge, i.e. v L B; 
we assume to begin with that there is no electric field E = 0. The usual Lorentz force 
F = q (E + v x B) reduces to the simpler form F = quB where F is perpendicular to both 
v and B; it is very important to remind ourselves here that the magnetic field does no work 
upon the charge (and vice versa). No net energy is exchanged between the charge and the 
magnetic field (in this classical picture!). The charge will thus move in a circular path that 
remains at right angles to the field. Note that if there is a component of the motion parallel 
to B then the charge will move in a helical path around the field lines.* 
Equating forces we have 


mv? 


— =qvB 6.56 
, (6.56) 


(where m is the mass of the charge) so therefore the radius of the circular path is just 


mu 


= (6.57) 


p 


The acceleration of the charge is 


quB 
a = — 


= WU, (6.58) 


m 


where we have defined the cyclotron frequency (also known as the Larmor frequency) 


v qB 
we = - = —. 
p m 


(6.59) 


This is of course the angular frequency of the cyclotron motion; the actual frequency (i.e. 
how many times the charge comes round past a fixed point) is just 


oe (6.60) 


The two most commonly used particles in accelerators are electrons and protons. Substitut- 
ing their masses into this formula we obtain the electron cyclotron frequency as ~ 28 GHz/T, 
and the proton cyclotron frequency as ~ 15.3 MHz/'T. In other words, low-energy electrons 
in a magnetic field of 1 T will gyrate in the field at 28 GHz; doubling the field will double 
that frequency. In a standard microwave oven magnetron, the electron gyration frequency 
is 2.45 GHz, and hence the magnetic field the electrons are immersed in must be about 
0.09 T (made by ordinary permanent magnets). 


*In plasma physics this is known as gyration. 
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Cyclotron Radiation: Power and Frequency 
Substituting the acceleration a above into Larmor’s formula directly gives us the power: 


gue? 


= Grace (6.61) 
In contrast to an antenna, the acceleration of the charge in a magnetic field is constant and 
hence there is no factor 1/2 in the average power. The output power x v?, in other words 
P x K where K is the kinetic energy of the charge. P is the total power emitted in all 
directions (i.e. over all angles ¢). Also, note that P is the power emitted by each charge; if 
we have N charges then the power simply adds up.* 

If we observe side-on a charge moving in a magnetic field, it looks at a sufficient distance 
like a Hertzian dipole.* This is of course why we examined the case of the Hertzian dipole 
earlier. We therefore expect the emitted radiation (in the far field) to be just the same: 
the frequency of the emitted radiation is the same as the cyclotron frequency. Also, the 
polarisation of the emitted radiation is parallel to the plane of the circular motion. 

In a real cyclotron there may be many, many protons moving together.* For example, a 
typical modern cyclotron might have protons circulating with a kinetic energy of 10 MeV 
(corresponding to a velocity v = 44x 10° ms~+ in a magnetic field of B = 1 T; y ~ 1 and the 
protons are not relativistic). The cyclotron frequency is we = 96 x 10° s~+ or fe = 15 MHz; 
proton cyclotrons have cyclotron frequencies which are tens of MHz. The power emitted 
per proton is P ~ 10-7? W, or P ~ 10718 W/pC. This is a very small value. What does it 
tell us? It tells us that protons don’t radiate very much, and so they don’t lose a significant 
amount of their energy when circulating in a magnetic field; hence our original assumption 
of circular motion is valid. We will see below that in some circumstances the radiation given 
out by a charge can be significant with respect to its initial kinetic energy. 

In this first derivation of the cyclotron frequency we obtained 


aa nies (6.62) 


by equating the centripetal force to the Lorentz force on the moving charge. However, we 
should remember that at a sufficient velocity the charge will gain mass. We know of course 
that a charge increases in mass according to m = ymo where y = E/Eo, so our derivation 
for the revolution frequency should really be 


2 


=quB (6.63) 
where v = c and m = ymo. Hence 
mpc = qBp, 
Bymoc 
= —_ 64 
ae (6.64) 


*We must be careful about this point; the radiation only adds up if it is emitted by the charges incoher- 
ently, i.e. they act as separate emitters. Look ahead at the discussion of coherent synchrotron radiation 
in Section 7.4 

*«Sufficient’ here means >> r, where r is the cyclotron radius; this means the side-to-side motion looks 
completely sinusoidal. 

*Each bunch might be around 1 pC. 
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so that the revolution frequency fr is 


1 qB 3 
bede aial, (6.65) 
27p 2rnp 2r ymo y 


This means that relativistically-moving charges emit cyclotron radiation at this modified 
frequency rather than at the classically-obtained value. Below, we will see that with a 
sufficiently-large y there are a number of other important differences. 

Whilst the radiation emitted per charge might not be very much and therefore doesn’t 
change the kinetic energy of the charges, if we have a very, very large number of charges then 
the output power can be quite significant. For example, a plasma* immersed in a magnetic 
field can give out a significant amount of radiation; at any given temperature the electrons 
will have much greater typical (thermal) velocity v than the ions, and so only the electrons 
will be significantly radiating. If we have a density Ne electrons per unit volume, then the 
radiation from the plasma can be written as 


Neq?w2v? 


P = 
6reoc? 


(6.66) 
(per unit volume). Re-writing v in terms of the kinetic energy K, we have P ~ 6.2 x 
1072 N. BK Wm? where K is the kinetic energy in eV [1]. 

If the plasma lies within a uniform (i.e. constant) magnetic field, then the emitted 
radiation has a well-defined single frequency equal to the cyclotron frequency fe (for the 
electrons). If the magnetic field is not constant (say, it varies by some small amount across 
the region occupied by the plasma), then there will be other frequencies also emitted which 
are harmonics of fe, i.e. at 2f., 3fe and so on. We can calculate the intensity of these other 
frequency components by calculating the Fourier transform of the magnetic field variation. 
An example is that of the typical domestic/commercial fluorescent lamp*; these contain 
a plasma of ionised mercury vapour caused by an ‘arc’ ignited with a sufficient voltage; 
the voltage causes the gas molecules to break down (ionise), and the free electrons then 
move and cause further ionisations. This is known as a gas discharge lamp; mercury vapour 
discharges give out blue/ultraviolet wavelengths, and a phosphor coating on the inside of 
the glass envelope of the tube converts that into a decent spectrum of white light. A typical 
kinetic energy of the moving charges inside the fluorescent tube might be K ~ 1 eV, and 
the charge density might be Ne ~ 1017 m~. If we take a (switched-on) fluorescent tube and 
place a magnet near it (giving a field in the tube of — say — 0.1 T), then the emitted power 
from the electrons in the plasma is P ~ 6 x 1075 Wm~°. Note that the volume of plasma 
inside a fluorescent tube is a small fraction of 1 më, so the emitted power is quite small; 
but it is quite detectable. Also, in contrast to the visible light from the phosphor (which 
emits frequencies around 1014 Hz), the cyclotron radiation from the plasma electrons has 
an emitted frequency here of fe ~ 2.8 GHz (the cyclotron frequency of electrons is much 
higher than for protons or ions in the same magnetic field). 

We here mention a paradox. Elsewhere in this textbook we frequently encounter current- 
carrying coils which — for example — are used to generate magnetic fields (usually using iron 
poles and yokes). Since there are electrons circulating in those coils we might expect that 
they should radiate, since they obviously must be accelerating inwards in order to go around 
the coils. However, they do not. One way to see that they do not is to remark (in our other 


*A plasma here is defined as a volume of ionised atoms in which the numbers of positive and negative 
charges add up to give a quasi-neutral overall charge. 
*The long ‘tubes’ you often see above your head in lecture theatres and labs. 
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argument for whether radiation occurs) that in a coil of wire carrying a constant current 
I there are no time-varying electric or magnetic fields; hence there should be no radiation. 
How do we resolve this apparent paradox? Clearly, for a sufficiently-smooth distribution of 
charges (that gives a constant current), there must be cancellation of the radiative fields from 
each charge. Conversely, we therefore would expect that a non-uniformity of the electron 
distribution will give rise to net radiation; the frequency spectrum of the emitted radiation 
should be the Fourier transform of the time variation of the electron density. This is what 
we see, but we shall not derive it here. Another phenomenon is that nearby charges can 
give an enhancement of the radiation; this is the phenomenon of coherent radiation, which 
will be discussed in Chapter 7. 


6.2.2 Synchrotron Radiation 


We saw that cyclotron radiation is the electromagnetic radiation emitted by non-relativistic 
charges deflected by moving through a magnetic field — often in a circular path. Synchrotron 
radiation is the equivalent process, but for when the charges are moving relativistically 
(y > 1). In the previous section we saw that relativity modifies the formula for the cyclotron 
frequency; it also greatly changes the pattern and strength of the emitted radiation. 

We saw earlier that a Lorentz transformation of the electric field around a moving charge 
compresses the electric field lines into a ‘pancake’ with characteristic width ~ 1/y transverse 
to the direction of charge motion (as seen by an observer in a different frame of reference). 
A similar Lorentz transformation of the classical Hertzian dipole radiation pattern results 
in the pattern of radiation emitted by a relativistically-moving charge. The difference here 
is that the radiation is compressed into a typical width ~ 1/y in the direction of charge 
motion. Also, the compression is not symmetric: there is much more radiation emitted in 
the forward direction than in the backward direction (see Fig 6.12). It is possible to obtain 
the radiation pattern directly by considering the Liénard-Wiechert potentials [2]. 

The fact that the opening angle of the radiation is compressed to 0 ~ 1/7 by a Lorentz 
transformation has some important consequences for the nature of the radiation emitted. We 
can explain those using some simple arguments. Firstly, we picture the effect of the Lorentz 
transformation on the apparent acceleration experienced by the charge. The (transverse) 
acceleration in the magnetic field is a = d?x/dt? in the charge’s frame; as seen by a (station- 
ary) observer, the apparent distance dz* = da is unchanged by the Lorentz transformation, 
however the apparent time dt = ydt* to give a* = ya. The acceleration appears to the 
charge to be occurring over a longer time, and hence there is more radiation emitted. Hence 
the power emitted is 


2 ( A ie. Dig aa 
pati) _ vary 


= = ‘ 6.67 
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The radiated power is increased by a factor 7*, which can be enormous if y is significant. 

As an example, we consider a proton and electron with the same kinetic energy K = 
250 MeV. The proton has y ~ 1.3 — the cyclotron radiation power is increased by a factor 
1.34 ~ 3 — and the very small radiated power per proton is still very small. In contrast, 
the electron has y ~ 500; the radiated power is increased by a factor 5004 ~ 1011, which 
is huge. The limit of this radiated power upon the ultimate energy achievable by electrons 
was first realised in 1946 by John Blewett [3], and later described by Julian Schwinger in 
1949 [4]. 

The comparison between protons and electrons is hugely technologically significant. As 
we have seen, using electrons means two things: the radiated power is much higher than 
it would be for protons; also the radiation is much more forward-directed, which means it 
is easier to utilise in experiments. Conversely, when colliding particles together in storage 
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y=4 y=8 
FIGURE 6.12 Variation in the radiation pattern of charge orbiting (anticlockwise) in a magnetic field 


as y rises. When y = 1 (non-relativistic motion) the emitted radiation follows the Larmor formula. A 
dramatic reduction in the opening angle is already apparent for moderate values of ¥. 


rings* to do particle physics experiments, protons are advantageous because they emit far 
less electromagnetic radiation; et-e~ colliders have the limitation that each doubling of the 
collision energy gives rise to sizteen times the amount of energy lost to radiation, which 
eventually becomes too costly to replace. Some people think the LEP-2 collider, in which 
the stored electron/positron energies were in excess of 100 GeV, is the largest energy one 
can store electrons — even in a relatively large circumference of 27 km. 


The Spectrum of Synchrotron Radiation 


We saw above that cyclotron radiation is mostly emitted at the same frequency with which 
the charges are oscillating (either back and forth or when orbiting in a field). Synchrotron 
radiation is completely different. We may understand what’s going on by imagining an 
observer viewing an orbiting, relativistically-moving charge; rather than seeing continuous 
cyclotron radiation, the observer only sees synchrotron radiation for a short time per orbit. 
The observed pulse length is shortened because of the 1/y factor of the radiation opening 
angle, and shortened by another factor 1/7 because of Lorentz contraction; Fig 6.13 illus- 
trates this. Hence the typical emitted frequency of the synchrotron radiation is related to 
the cyclotron frequency as 


fs ~ fey’. (6.68) 


*We saw in Chapter 2 that a storage ring is a particle accelerator in which the particles are stored for 
long times by making them orbit repeatedly using dipole magnets. 
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Another way to explain it is to take the emitted cyclotron frequency in the charge’s rest 
frame, and apply a relativistic Doppler shift into the observer’s frame of reference. Hence 
the observed frequency of the synchrotron radiation is 


e Ji 2 
: -Jir pT’ (6.69) 


where we can make the approximation given for the Doppler formula because 6 ~ 1. 


Radiation pulse shortened by factor y 


Back of pulse catches up with 
front of pulse by factor y 


FIGURE 6.13 Illustration of why the frequency spectrum of synchrotron radiation is pulsed with typical 
frequency y? Je 


Both these arguments lead us to conclude that synchrotron radiation has a typical emit- 
ted frequency which is many times higher than the cyclotron frequency. However, whilst 
cyclotron radiation is emitted at a single frequency (i.e. at the cyclotron frequency), syn- 
chrotron radiation is emitted over a wide range of frequencies; this is because of the pulse 
length shortening we just mentioned. 

Synchrotron radiation — even from a single electron — is pulsed because the narrow angle 
of emission has the effect that it is only observed fleetingly. It is this pulsed nature that 
means that it is composed of a wide range of frequency components.* We can quantify this 
by comparing two quantities. Let’s consider a charge of rest mass mo moving relativistically 
in a circle due to a uniform magnetic field B. The pulse period (how long it takes the charge 


to orbit once in the field) is 
1 _ 27ymMo 7 


= F Be (6.70) 
where fe = eB/2mm_-. The pulse duration is 
1 1 
dt= 5 = e- (6.71) 
Hence the pulse duration is much shorter than the period: 
ao, (6.72) 
pi 


So-called synchrotron radiation facilities (or ‘sources’) utilise these various properties 
of the radiation emitted by relativistically-moving charges. Because the overall power from 
electrons far exceeds that from protons, all synchrotron radiation sources utilise electrons.* 


*A shorter time duration means a wider frequency spread because one is the Fourier transform of the 
other. 

* Actually, some facilities have used positrons instead; positrons have the same rest mass of 0.511 MeV/ 2 
as electrons giving the same power output for the same stored energy and beam current. 
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As an example, suppose a synchrotron radiation facility has electrons of K = 1 GeV circu- 
lating in a constant magnetic field of 1 T. Hence y ~ 2000 and fe = 28 GHz. The orbital 
period of the electrons is 71 ns, but the pulse duration is 0.9 attoseconds; the synchrotron 
radiation typical frequency is fs ~ 10!7 Hz. In other words, photons of typical energy 
E = hf, ~ 460 eV are emitted; these are so-called soft X-rays. This sets a scale for syn- 
chrotron radiation sources; to make X-rays in typical dipole fields of 1 T, we need to use 
electrons with energies ~ 1 GeV or more. 

Our stored electrons above emit light pulses that are basically periodic 6-functions with 
period t,. Taking the Fourier transform of this, we can see that the frequency spectrum of 
the synchrotron radiation extends up to ~ f;, with frequency components spaced at 1/t,, 
in other words, spaced apart in frequency by fe/y; this is shown schematically in Fig 6.14. 
fs >> fc/y, so the frequency spectrum of synchrotron radiation is basically continuous up 
to fs. 

In summary: relativistically-moving charges emit light which appears pulsed in time to 
an observer. The pulsed nature of the light means that it must be composed of many different 
frequencies from zero up to the typical frequency fs. Hence, we can see that synchrotron 
radiation is composed of photons from zero energy up to €s ~ hfs. However, each emitted 
photon is still polarised in the same direction as the electron motion; hence, synchrotron 
radiation observed in the plane of the orbit is linearly polarised, whilst when observed out 
of the orbit plane (at some angle w, say) the radiation will be elliptically polarised. 


Single light pulse 7 
(a) A A 
; A 
Fourier 
ooo o 
í fs p 
(b) A A 
g A 
Fourier 
WM o 
t ; fe p fs w 
Periodic light pulse Spacing F <fs 


FIGURE 6.14 (a) A single light pulse of duration dt gives a frequency spectrum which is continuous 
up to a frequency fs = 1/dt. (b) A train of light pulses each of duration dt and separated by a period 
T=y7 / fc gives a frequency spectrum that still extends up to fs, but is now composed of a set of discrete 
lines separated in frequency by 1/T = fe/y = fs/ g3. An observer of the radiation from a relativistic 
electron moving in a circle will see periodic pulses of light of this nature; 7 may be very large indeed since 
typical values of y encountered are ~ 10°. 


Another way to write fs is as follows. We recall once more that 


B 
f= PiP (6.73) 
TMo 


But we know that the radius of curvature of a charge moving in a magnetic field B is just 


Bymoc 
Sa. 6.74 
p n (6.74) 
We see that 
aB _ Bye 


, 6.75 
My (6.75) 
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so that we can express fs as 


_ cB 


p= ; 6.76 
Í 3ng (6.76) 
Hence the typical emitted photon energy is 
heBy? 
s = h b= : š 
€ f Jap (6.77) 


Using our example synchrotron radiation source, our 1 GeV electron moving in a 1 T 
magnetic field has a bending radius of p = 3.336 m. We again obtain fs ~ 10!” Hz, and a 
typical emitted photon energy of €s = hfs ~ 460 eV. Note that for very relativistic electrons 
(y > 1), a very useful formula relating the electron energy E = ymec? to the magnetic field 
and bending radius is 

E [GeV] ~ 0.3Bp [Tm], (6.78) 


where the units to use are given in square brackets (some books will give the slightly 
more accurate formula E [GeV] ~ 0.2998Bp [Tm]) [5]. This formula doesn’t work at all for 
protons, of course. 


Critical Photon Energy and the Emitted Photon Number 


In the preceding discussion we have calculated the typical photon frequency fs, where 


cpa? 
Tam Dy (6.79) 
A fuller calculation can be done than the one we have done here, where a critical frequency 
can be defined such that, half the radiation power is emitted in photons above the critical 
frequency, and half the radiation power is emitted in photons below the critical frequency 
(this is shown later in Section 6.2.3). Hence the critical frequency is also known as the half 
power point. Since the energy of photons below the critical frequency is obviously lower 
than that of the energy of photons above the critical frequency, and also the frequency can 
extend all the way down to zero, synchrotron radiation is composed of very, very many 
low-frequency photons and rather fewer high-energy photons. The derivation for the critical 
frequency gives 
3 c73 
227p 
(since we always use high-energy electrons, we have here set 8 = 1 so that there is a 
corresponding critical energy of 


ferit = (6.80) 


O 3 hey? 


an 81 
€ p (6.81) 


We can write the critical energy in convenient units as €e [keV] ~ 2.218E3/p or €e [keV] ~ 
0.665E? B, where E is given in GeV and B, p are in SI units. Alternatively we may calculate 
the corresponding critical wavelength [6], which is Aeri [Å] ~ 18.64/E?B. 

Since there are many, many more low-energy photons than high-energy photons, the 
average photon energy is lower than the critical energy. The mean energy of the photons 
(see below) is 


(e) = 5 (6.82) 


It is worth remarking here about the effect of the quantised nature of the photon emission; 
an electron experiences a small recoil that lowers the emitted photon energy. Schwinger in 
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1954 calculated the approximate effect on the overall emitted power [7]; the corrected power 
(including the quantum effect) is 


55 Ec 
Peep fen a ee 6.83 
: ( eae) ee) 


where P is the power calculated without that correction. For a 3 GeV electron orbiting in 
a 1.4 T magnetic field (critical energy €e = 8.3 keV), the correction is around 5.5 x 107°, 
which is small enough to ignore entirely. 

It’s instructive to consider how many photons are emitted per orbit of the charge. In a 
uniform field B, a charge emits electromagnetic radiation with average power 


B g a294 


P (6.84) 


6regc3’ 


The charge orbits in a circle of radius p = ymoc/eB, so that the acceleration a = v?/p 
where v = c. Hence the power P may be written as 


pa q? B+cty4 
6rEqc? p? 
2 ,R4A4 
q“cp*y 


The radiation energy Uo emitted during one orbit (which takes t, to happen) is 


244.4 
2 
sa ee 
6reoc’p? Be 
223.4 
g p? 
= l 6.86 
P (6.86) 


Uo is known as the energy loss per turn. 

So far we have not considered a specific particle type. However, in nearly all practical 
cases we are dealing with electrons that have a large kinetic energy (say, 10 MeV or higher 
— usually much higher). Hence q = e, mp = me, and 6 = 1 to a very good approximation. 


Therefore we have 
O 8/3 hey? e244 


(e) = T z = Io (6.87) 
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We can then estimate the number of photons emitted per orbit as 


Uo 45 2 p e744 
N, = = . 6.88 
7 (€) 833 hey? 3e0p Pee) 


We note here that the fine-structure constant, a, can be written as 


4ra = ——. (6.89) 


We can therefore write N, more simply as 


_ on 
V3 


This is a very pleasing formula, since it contains only dimensionless constants. Note that 
N, is independent of the circumference, and therefore independent of the bending field B; 


Ny ay ~ 0.06627. (6.90) 
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a larger bending field leads to a greater rate of photon production but a smaller orbit time. 
The overall rate of photon production is therefore just 


Ne, (6.91) 


A typical storage ring will have y ~ 10° and 7, ~ 107° s, so that each electron emits ~ 108 
photons per second. 


Calculating Synchrotron Radiation Output 


We will now carry out some example calculations of photon output for the Diamond storage 
ring in Oxfordshire (UK); this is a typical storage ring source used to generate X-rays for 
a variety of scientific experiments.* The electrons in Diamond are maintained at a kinetic 
energy K = 3 GeV, and pass through dipole magnets that give a field of 1.4 T, which 
corresponds to a bending radius p = 7.1 m; note that the circumference L of the storage 
ring is not L = 27 p, since not all of the path taken by the electrons has a bending field 
B applied.* In Diamond, the circumference L = 561.6 m, so that the revolution period is 
Tp = L/c ~ 1.87 us. Hence the critical energy of the photons is ee = 8.3 keV and critical 
wavelength Acrit = 1.48 A, and the mean photon energy is (e) = 2.6 keV. 

Of course, there isn’t just one electron orbiting in Diamond. Knowing that an ammeter 
placed at any point in the storage ring measures a typical passing current of 300 mA and 
that obviously I = AQ/At, the total charge in the storage ring AQ is 


Mx Te” (6.92) 
C 


(see Fig 6.15) where the circumference is L = 561.6 m, and At = 7,. The number of electrons 
is then just Ne = AQ/e ~ 3.5 x 101? (the stored charge AQ ~ 560 nC) for a current of 
300 mA. 


FIGURE 6.15 Relating the current I (observed at some point along the circumference L) for a total 
number of electrons Ne. 


*There are over fifty such third-generation facilities in the world today. 

*In fact, in most storage rings only a small fraction of the particle path has dipole field; in Diamond 
only about 8% of the circumference is dipole magnets. The word ‘circumference’ when used for storage 
rings is therefore a bit of a misnomer; by ‘circumference’ we mean the total distance travelled by the 
particle in one orbital period. 
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By comparing the synchrotron radiation power to the revolution period, we can straight- 
forwardly obtain that the energy loss per turn is 
2.4 
Uy = L ~ 1.0 MeV. (6.93) 
3€0P 


The total power radiated by each electron is P, = 86 nW, but since there are ~ 10!” 
electrons, the total power emitted is Potali = NeP. ~ 300 kW. This is a simply enormous 
power. Synchrotron radiation facilities such as Diamond are the only known method of 
producing such a large quantity of X-ray photons; they are one of the brightest artificial 
sources of photons. The number of photons emitted by each electron as it executes a single 
orbit is N, = 5ray/ V3 ~ 380 photons, or ~ 2x108 photons per second. Hence the radiation 
may be treated as ‘quasi-continuous’. 

You might be wondering what the effect is of splitting up the dipoles into pieces, rather 
than having a continuous bending B field as we had in our original derivation for Ny. The 
way to understand it is that photons are only being emitted when the electrons are passing 
through the dipole magnets; they’re not being emitted when there is no B field accelerating 
them.* Hence, N, is just the same regardless of whether there are ‘gaps’ in the B field. 
try/fe is still calculated the same way, but now it is not the same as Tp, the time it takes 
for the electrons to go around the storage ring; 7, is only needed to convert beam current 
into a number of electrons. 

In our example of the Diamond storage ring we note that the radiation from Ne electrons 
has a power P x Ne; this radiation is incoherent synchrotron radiation (ISR). For a beam 
current J, the total emitted power is 


4 
ey ly 
Pro al = ; 6.94 
total Sean (6.94) 
this may be expressed in practical units as 
E [GeV]*f, [A 
Protal [kW] = gg. 4 [GeV] B [A] (6.95) 


p [m] 


Another way to express the emitted power is simply as P,otaı [KW] = Uo [keV]; [A]. 


6.2.3 The Spectrum of Emitted Synchrotron Radiation 


In this section we consider only the emission of radiation by ultra-relativistic electrons, 
since these are the only particles used practically in synchrotron radiation sources. As we 
saw above, an electron circulating in a uniform magnetic field B has an effective angular 
frequency wo that may be written as 


S 


7 (6.96) 


wo 

where p is the bending radius of the electron. An (angular) critical frequency may also be 
defined as 

We = 3cy? /2p. (6.97) 


* Actually, in real storage rings there are extra ‘insertion’ devices (see later in this chapter) that can 
produce additional photons, but this fact doesn’t change the basic argument about N4. 
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It can be shown that [8] the horizontal and vertical electric field components of the far-field 
radiation at a given frequency w — as seen by an observer looking at the electron at some 
angle w to the plane of the electron orbit — are 


Elo) = apg (2) (0+7?) Kapal@), 
Bylo) = ian (E) +P)? Kal). (6.98) 


R is the distance of the observer from the electron, K/3(G) and K1,3(G) are the standard 
modified Bessel functions, and 


om (=) (1+ yyy? (6.99) 


In the far field, the radiation seen by the observer must be an electric field E perpendicular 
to the observation direction n, so that the Poynting vector is 


S = egcE*n. (6.100) 


If the observer sees the radiation over a solid angle AQ, the total energy passing through 
this area (R?AQ) in some time At is 


W = (n- S)AtR?AQ = egcE? R?AtAQ. (6.101) 
We can relate this to the total power emitted P(t) over time as 


AT 
P(t) = 7 = coc E(t)? R7dQ. (6.102) 


Writing E(t) in terms of its Fourier transform 


1 a ; 
E(t) = —— E(wje tdw 6.103 
O=- Bw) (6.103) 
we can obtain the energy passing through a solid angle as 
dw ee 
—— = 2cR? i |E(w)|?dw. (6.104) 


We may thus define the spectral angular distribution (i.e. into a bandwidth dw around a 
given frequency w) as 
aw 
dQdw 
Since the electron executes c/27p revolutions per second in the magnetic field B, we may 
write the spectral power density as 


= 2epcR?|E(w)|’. (6.105) 


d? P c d2w R2 > 
dQdw  2rpdQdw T : (6.106) 


Since |E(w)|? = E7(w)+E7(w), we may finally obtain the spectral power density of bending 
magnet (dipole) radiation as 


d?P se*q? w\? 2,2)? | g2 PY? 2 
T (2) (1+74’) [Kao + ag Kis) ; (6.107) 
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Integrating over all angles gives the spectral power 
dP d? P d? P 
— = | ——d0=2 ———d 
dw J ‘io / apd” 
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where we have now written the total power as 


ce2y4 


and the relative spectral powers of the horizontal and vertical polarised components S$, and 
Sy (and their sum S) are 


w 93w a w 
Sy (=) — EE | j K5/3(u)du + K2;/3 (2) ; 


We 167w, 
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S (2) ee Ks5/3(u)du. (6.110) 
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The function S(w/we) is universal (common to any we) and is shown in Fig 6.16 [8, 6, 5]. 
Integrating over all frequencies we obtain 


f (E) aye. =1. (6.111) 


We see that, of all the power radiated, exactly 7/8 is horizontally-polarised whilst 1/8 is 
vertically-polarised, in other words P,/P, = 7. This is in contrast to cyclotron radiation, 
where P,/Py = 3. Integrating frequencies up to w = we only, we see 


J, ° a A (uf) = 5. (6.112) 


This demonstrates what we said earlier, which is that half the total radiation power is 
emitted at frequencies below the critical frequency we. Finally, we may similarly obtain how 
the emitted power varies with observation angle ~ as 


2,2 
a ma h i (6.113) 


dy 32.1 +7242)" Ty?) 
Photon Flux 


We have so far derived the emitted synchrotron radiation power as a function of emission 
frequency. It is straightforward to re-express this in terms of the number of photons. Each 
emitted photon has an energy € = hw/2m = hw, so the critical energy of emission is €e = hwe. 
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If the number of photons emitted per second with energy «€ is N(e), then the power emitted 
at that energy is just eN (€). We may use this to determine the number of photons emitted 
into a bandwidth de/e as 


dN _ «d?N 
dQde/e — dQde 
O ËP @&P 
~ dde  AdNdw 
362472 w \? E E PY? 2 
= =, — | — 1 K: — K ; .114 
32n4hegp (=) ( ay Y’) | 2/3(G) + (1+ 7202) 1/3(G) (6 ) 
Using the definition of the fine-structure constant 
e? 1 
= aS 6.115 
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we may obtain 
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(1+7) | K3/5(G) + YO" _ KG) (6.116) 
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This quantity dN /dQ is the spectral intensity. If we have Ne electrons, we may re-cast this 
expression in terms of the beam current I, = N-ec/27p to obtain 


dN 3ay7? Ih (ôe wo \7 Oe. 
= 1 
dQ 4r? e (e We ( PEY ) 
2 Py? 2 
This expression can be given for the on-axis (q = 0) emission from a beam current J, (in 
Amperes) of electrons of energy E (given in GeV) as 


AN] 433 x 10'3n27, (“ 2 aa (6.118) 
dQ p=0 . We 2/3 Bi), ` 
which has been given in the usual units of photons per second per milliradian squared per 
0.1% bandwidth. From this we can obtain the total rate of photon emission into a given 
energy bandwidth de/e = dw/w as 


7 I, / ôe € a 
N = V¥3ay— {| — ] (— K5/3(u)du. (6.119) 
e\e €c) Ju=ejec 

This quantity — known as the spectral photon flux — is consistent with the earlier expression 
for photons emitted in an orbit, given in terms of the typical photon energy, Ny = 5ray/ v3. 
We may write this as 


N = 2.46 x 10ER (<) 1 Ks/s(u)du. (6.120) 
Ec u=e/€o 

Here, N has been given in units of photons per second, per horizontal milliradian, per 

0.1% bandwidth. This is a very useful expression because it follows the universal curve 

€/€¢ Wiese K5/3(u)du. The spectral flux peaks at around €/e, ~ 0.25, and falls off sharply 

above ¢/e. ~ 5; when €/e. = 10 the emitted power has fallen by about 3400x. 
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FIGURE 6.16 The universal curve S (w / We) for synchrotron radiation. 


Synchrotron Radiation Sources 


We have so far discussed dipole radiation by relativistic electrons — also known as bending 
magnet radiation — which arises naturally in any particle accelerator where those electrons 
have sufficient kinetic energy. One important application of this is in so-called synchrotron 
radiation sources, where this radiation is deliberately generated at specific wavelengths 
for use in a wide variety of scientific research. For example, X-ray diffraction experiments 
utilise photons whose wavelength is comparable to the inter-atomic separation in solids — 
i.e. about 1 A; a crystal formed from many atoms has a regular, periodic arrangement of its 
constituent atoms that will give rise to interference between the radiation scattered from 
each atom — a diffraction pattern is formed that can be used to elucidate the arrangement of 
the atoms. Whether a simple crystal such as NaCl or one made of more complex molecules 
such as proteins, the formation of clear diffraction spots depends both upon the spread 
of wavelengths incident upon the crystal and upon how parallel the X-rays are. A good 
X-ray beam brightness depends upon a small electron beam source size at the location of 
synchrotron radiation emission, and the natural broad spectrum of synchrotron emissions 
must have suitable wavelengths selected from it using a monochromator. To quickly form a 
distinct diffraction pattern requires the highest possible X-ray intensity, and is limited by 
the ability of the monochromator and other X-ray optics to handle the heat load; typical 
limits are several hundred W/mm?. 

Synchrotron radiation sources are dedicated facilities — usually large (e.g. covering areas 
exceeding 100 m x 100 m) — providing a variety of experimental beamlines with radiation 
tailored differently depending upon the use [9, 6, 10]. The predominant sources in use to- 
day are so-called third-generation sources, which are electron storage rings within which 
electrons of several GeV in energy circulate for many hours at a time; third-generation 
sources, which are designed by definition to incorporate insertion devices (see below), have 
largely supplanted the earlier first-generation sources that parasitically used electron syn- 
chrotrons (SURF, Tantalus-I, NINA), and second-generation sources that relied mostly on 
dipole radiation (such as the Daresbury SRS, and NSLS in the United States). 

One simple way to vary the output wavelength in a storage ring is to use a wavelength 
shifter, which is essentially a high-field dipole inserted amongst the other dipoles which are 
needed to form the overall storage ring. Hence a wavelength shifter is a type of insertion 
device, whose magnetic field can be turned on or off without significantly affecting the 
operation of the rest of the storage ring. The first wavelength shifters attempted to extend 
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the output radiation emission to shorter wavelengths, and hence a higher field was required. 
Often, superconducting magnets are used to achieve fields as high as 6 T or more; an 
example is shown in Fig 6.17. 
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FIGURE 6.17 An example of a high-field wavelength shifter, as used at the 2 GeV Daresbury Syn- 
chrotron Radiation Source in the 1990s and up to 2008. A central 6 T field created by superconducting 
coils is used to generate a larger critical photon energy than was obtained using the main 1.2 T dipoles. 
Two lower-field ancillary poles lie either side of the main pole to create a localised orbit ‘bump’ within the 
device so that it can be turned on and off without changing the overall geometry of the storage ring [11]; 
some small beam-optical corrections are however still required. It is hence known as an insertion device. 
(Diagram adapted from original © STFC.) 


6.2.4 Wiggler Radiation 


An extension to the idea of the wavelength shifter is the multipole wiggler; a multipole 
wiggler comprises an alternating field arranged along an electron’s path, provided by poles 
of alternating polarity [8]. An example of a multipole wiggler is shown in Fig 6.18. Assuming 
to begin with that there is only a vertical magnetic field B, (as in an ordinary dipole 
magnet), an alternating magnetic field may be approximately described as sinusoidal with 
a spatial period A, equal to the distance between neighbouring north poles. Thus 


2 
B,(s) = —Bosin (=) (6.121) 
where Bo is the peak field in the wiggler. The resultant acceleration on the electron is 
& = d*x/ds? = eB,/ymoc and only in the horizontal plane. The electron deflection angle i 
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is 


K 2 
(s) = — cos (=) (6.122) 
where the so-called K-parameter is 
Boe Xu 
K = 2E% = 0.9336 Bodu (6.123) 
moc 2T 


and where in the right-hand expression, Bo is expressed in T and A, in centimetres. K is 
dimensionless since « = dx/ds. Integrating, we obtain the path through the wiggler as 


es e in (=) (6.124) 


The usefulness of K is that the maximum angular deflection is K/y. Since the opening 
angle of the emitted radiation is ~ 1/7, then if K < 1 the radiation from each pair of 
poles overlaps — giving rise to interference of the radiation — whilst if K >> 1 then there is 
little overlap and the radiation from each pole pair is effectively independent. The condition 
K > 1 defines a multipole wiggler, and K < 1 defines an undulator; otherwise, they are 
much the same. Obviously, undulators typically utilise magnetic fields lower than those in 
wigglers — say, less than ~1 T. In practice there is a regime between K ~ 1 and kK ~ 5 
where there is some interference between the poles. 


aaa 
aie 


FIGURE 6.18 An example of a multipole wiggler, here generating an on-axis field with a maximum 2.4 T 
using a hybrid arrangement of permanent-magnet pieces and steel poles. Half-poles at each end compensate 
the overall orbit shift, and an adjustable gap allows variation of the field (and hence the output photon 
energy). (Photograph ©) STFC, diagram adapted from original ©) STFC.) 
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Wigglers emit radiation with a critical energy 


_ 3hey? 


Ec = 


12 
a (6.125) 


that depends on the instantaneous bending radius p. As with dipole radiation we may state 
this in practical units as 
€e [eV] = 665.025 E[GeV]?B [T]. (6.126) 


Since the wiggler is an insertion device, it can be turned on and off, but more importantly its 
field may be adjusted more or less at will between those values.* Electromagnetic wigglers 
(EMWs) may adjust their field simply by varying the current that drives the field through 
the wiggler poles; permanent-magnet wigglers (PMWs) can vary their field by varying the 
gap between the poles (with some limitations). 

As discussed previously in Chapter 4, PMWs typically use poles made from either SmCo 
(remanent field 0.9-1.1 T) or NdFeB (remanent field 1.1-1.4 T), either in a pure permanent 
magnet (PPM) arrangement using only permanent magnetic material [12, 9, 8], or with 
the addition of steel pole pieces in a hybrid arrangement to augment the on-axis field (see 
for example Fig 4.23 in Section 4.5). In a PPM configuration the maximum on-axis field 
attainable is around 

By = 1.72B,e-79/~, (6.127) 


where g is the gap between the poles (i.e. the available height for beam and vacuum vessel); 
in practice g/Au < 0.1 is a realistic limit, so that PPM wigglers are limited to fields no more 
than around 1.5 T. A hybrid wiggler might augment the on-axis field by perhaps 30%. 
Multipole wigglers (MPWs) are different from ordinary dipoles in that the effective 
critical energy seen depends upon the horizontal observation angle with respect to the axis 
of the wiggler. Viewed by an observer looking along the wiggler axis, the maximum critical 
energy (denoted eco) is 
cco [eV] = 665.025 E[GeV]*B [T]. (6.128) 


Radiation into other observation angles 0 is determined by the critical energy when the 
electron is pointing at 0; this varies with s and is 


218 


peer poe 12 
€ eosin (2) (6.129) 


u 


Knowing that cos(2rs/àu) = 0y/K, we find the critical energy at angle 0 is 


| 2 | 2 
Ec = Eco 1- (2) = ta (a J (6.130) 


Multipole Wiggler Flux and Tuning 


The previous discussion allows us to summarise the emission advantages of a multipole 
wiggler over a dipole magnet: 


e The (on-axis) critical energy may be conveniently adjusted, independently of the electron 
energy and of other devices in the synchrotron radiation facility. 


* Usually some small but manageable adjustments are made to the beam focusing when the wiggler field 
is changed. 
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e The critical energy varies with observation angle, which allows some additional tuning. 


e The total wiggler flux is increased by a factor Nu, where N, is the number of wiggler 
poles. 


To give a sense of the advantages, a typical EMW might comprise 50 pole pairs (NV, = 50), 
each with a peak magnetic field of perhaps 1.6 T if normal conducting, or 4 to 5 T if 
superconducting, which is variable down to zero. The total power emitted from any insertion 
device with a sinusoidally-varying field may be simply obtained as 
1 2.22407 2 p2 

Pota = gremec y K `~ 632.8E° Bo Ll, (6.131) 
where E is the electron energy in GeV. Similar to dipole radiation, radiation from a multipole 
wiggler is linearly-polarised when viewed in the plane of the electron oscillations. 


6.2.5 Undulators 


An undulator is defined as a multipole device where the output is dominated by interference 
effects [13]. We already know that when K < 1 there is significant overlap of the emitted 
radiation from each pole pair; we therefore expect interference will occur at certain wave- 
lengths, enhancing the output intensity; this idea was first proposed by Vitaly Ginzberg in 
1947 [14, 4] and verified experimentally by Hans Motz in 1953 [15]. To determine the wave- 
lengths for which constructive interference occurs, we again assume the electron motion is 
sinusoidal; the combination of its finite electron velocity 8 < 1 and its periodic transverse 
velocity causes the electron to fall behind the photons it has emitted. The average velocity 
in the forward direction is 


3) ~1 f 6.132 
(Bs) = B Gee (6.132) 
The condition for interference to occur is that each electron should fall behind its emit- 
ted wavefront by a whole number of wavelengths per period of undulator passed through. 
Observed at a horizontal angle 0, the condition for constructive interference at emitted 


wavelength A is 
Xu 
nA = ——~ — Ay, cos 0. 6.133 
(B) ee 


Substituting the value for (8,) and for small angles of 6, we can hence obtain the so-called 
undulator equation 


ru K? 54 
ioe (1+ 5 +e). (6.134) 
Each value of n is known as the harmonic number of the emission (not to be confused 
with the storing ring harmonic number h, which is the number of circulating bunches). For 
example, a 3 GeV electron passing through an undulator with a period of 50 mm and K = 1 
emits on-axis photons (0 = 0) with a wavelength of 1.1 nm, or an energy of 1.1 keV. The most 
important thing to note about the undulator equation is the 7? factor between the undulator 
wavelength and the emission wavelength; this arises because of Lorentz contraction acting 
to shrink the period of the undulator as observed by the electron and a Doppler shift of 
the electron emission into the observation (laboratory) frame. y is typically a few thousand 
and undulators have a typical period A„ of a few centimetres, so we immediately see that 
emission wavelengths will typically be ~ 107° m; again, this is useful for typical X-ray 
experiments [8]. Also, note that as an undulator gap is closed, K increases which makes the 
output wavelength A longer. 
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Since interference is occurring from all the N, poles of the undulator, the emitted radi- 
ation will be confined within a certain bandwidth 


AX 1 


For example, a 100-period undulator will emit photons in the first harmonic with a wave- 
length spread of about 1%, which is around ten times larger than the typical spread of the 
electron energies in a storage ring (see later); the bandwidth is determined by the undulator 
and not by the electron energy spread. The opening angle of the radiation is limited also 


by interference effects to 
| 2X 
A0 ~ 4| ——. : 
0 ER (6.136) 


For example, a 50 mm-period undulator with 100 periods has A0 = 40 urad, which is less 
than the radiation opening angle 1/y ~ 170 urad at 3 GeV. It should also be noted that 
the on-axis radiation contains only the odd harmonics n = 1,3,5,7... . 

One of the key advantages of undulators is that they greatly enhance the radiation 
output at desired wavelengths whilst suppressing it at unwanted wavelengths. For a given 
photon flux at an experimental sample this means that much less unwanted X-ray power 
is dissipated on the monochromators.* The angular flux density of the emission is given in 
practical units as 

dN 


an = 1.74 x 10 N2 E?” RE AK), (6.137) 
=ü 
where TIEA 
n K 2 
F, (K) = To Cede E (6.138) 
and r 
nK 
4 (1+ K2/2) (6.139) 


When K is very small (<0.5), F (K) is only significantly greater than zero in the first 
harmonic; in other cases undulators can be utilised routinely up to harmonic number 15 or 
so. The angular flux density dN /dQ x N?, so for example an undulator with N, = 100 
periods gives a photon flux density in the first harmonic which is nearly N? = 10,000 times 
larger than from a simple dipole magnet. It is possible to show that the photon output in 
the fundamental harmonic from each electron passing through a magnet period is 

i= Tak? (6.140) 
from which the total photon output can be readily estimated. 

We have in the above discussion only considered ordinary undulators that deflect the 
electrons in a single plane; in this case the emitted photons are still linearly polarised when 
observed in the plane of electron oscillation, as they are from dipoles and multipole wigglers. 
There also exist a wide variety of more complicated magnetic arrangements in which the 
electrons may execute both horizontal and vertical motion, and devices may be constructed 
to give radiation with both tuneable wavelength and polarisation. 


*A monochromator is a device used to select one X-ray wavelength from a broadband source, and is 
usually made from a large single silicon crystal in conjunction with collimation 
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6.3 Scattering of Electromagnetic Radiation 


6.3.1 The Scattering Cross Section 


An electromagnetic wave passing over atoms causes the charges in those atoms to accelerate. 
Hence those charges radiate; this idea is shown schematically in Fig 6.19. The process of 
absorption of electromagnetic energy by atoms and then re-radiation of that energy is 
scattering. 


Scattered 
NLSIVIVS > NLSIVS > 5 
Incident Transmitted 


FIGURE 6.19 Scattering may be explained in a classical description by considering that incident radia- 
tion provides an electric field that accelerates the charges in a medium, removing energy from the incident 
radiation field. The accelerated charges emit radiation in many directions; the fraction in the original di- 
rection may be considered — along with the part of the incident radiation field that was not absorbed — 
the ‘transmitted’ radiation field. The radiation emitted in other directions is the ‘scattered’ radiation field. 
Note that in this classical picture there is not such a thing as an individual photon which is both incident 
and then scattered; rather, the incident photon is absorbed and then re-emitted in the radiated field. 


Atoms contain bound electrons, which will move to a position z due to the force imparted 
by a passing electromagnetic wave (the nuclei are more massive and effectively stay still). 
The displaced electrons give rise to an oscillating dipole moment in the atom of 


2 
e i 
mie Eo cos wt 


p(t) = —ez = E 


i 6.141 
— w?) + iwy ( ) 
Note that this expression holds for a single resonant frequency wo of the electrons, but 
we can extend this analysis for multiple frequencies if we wish. This oscillating dipole will 
radiate quasi-isotropically, i.e. like a Hertzian dipole, with total emitted power 
4 2 
wtlp(T)| 
P(t) = ———— 6.142 
O= (6.142) 
where T = t — r/c is the usual retarded time for an observer at a distance r from the moved 
electron. We can immediately combine those equations to obtain the average power emitted 
from each atom as 
et Eu 


~ 12megm2c3 (we — w?)? + wg? 


(P 


A question arises: can we relate the incident power to the radiated power? To do this, we 
use the idea of the cross section, which is the effective area (in this case) of the scattering 
objects. The cross section for an atom can be defined as 


(6.143) 


P 


Hy (6.144) 


go = 
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i.e. it is the ratio of the emitted power to the incident power. This is the total cross section, 
in other words it describes the rate at which the incident power is converted to radiated 
(scattered) power for any direction of the radiated power. We can subdivide this total cross 
section into the rate emitted at different angles 0 and ¢; this is a type of differential cross 
section. 

The cross section ø is an effective area presented by the atom (really, by the electrons 
in the atom) to the incoming radiation. Thinking of the incoming radiation as being made 
of discrete photons, some of those photons will strike the area g and those photons will 
be scattered; other photons that do not strike this effective area will not be scattered. o 
thus describes the proportion of photons that are scattered. We can understand how a cross 
section operates by counting the number of photons in the incident and scattered radiation 
as 


P=o,S.. (6.145) 
We Ne 


#/s m? #/m?s 
Since we can write the Poynting vector in terms of the energy density as 


1 
(S) = ce (U) = 5605, (6.146) 
we can re-write the scattered power (collecting separately the constants, frequency depen- 
dence and Poynting vector) as 


P et Ew 
HIS 12megm2c3 (we — w?)? + wg? 
_ et wt 1 Ee 
 6reZm2ct (w2 — w2)? + w29? gaa 
et wt 


= Grete Gee a (6.147) 


Hence, the total scattering cross section is 


et wt 


6rem?ct (we — w?)? + wg? 


(6.148) 


We can re-write this more simply by defining a constant (that you have probably seen 
before): we define the classical electron radius — which obviously has dimensions of length 


— as 
2 
l ~ 2.818 x 107 m. (6.149) 


Te = —— > 
A4regmc 


Substituting into our expression for g, we obtain 
8rr2 wt 


; .1 
3 (we — w?) + wg? (6130) 


We note that o has the correct dimensions (m?) for a cross section. This is a general form 
for the scattering cross section of atoms. 

Let’s look at some special cases of the scattering cross section. The first is for low- 
frequency photons for which w < wo. Hence this situation describes the scattering of light 
of wavelength much longer than the wavelengths at which absorption will be taking place 
in those atoms. We obtain a scattering cross section of 
oe 8rr2 wt 1 


S oie peat 151 
OR 3 wh >? (6.151) 
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where A is the wavelength of the incident/scattered radiation. We use the subscript R since 
this cross section (and the phenomenon that goes with it) is known as Rayleigh scattering. 
We see that shorter wavelengths are scattered much more than longer wavelengths. Consider 
the scattering of visible photons in air, which is an example of Rayleigh scattering. The 
relative rate of scattering of e.g. red and blue photons is given by 


( Aved ) ~ (= mm)" = 16. (6.152) 
Ablue 390 nm 

Despite the two wavelengths being comparatively close, the rate of scattering is dramatically 
different. This is the explanation for why the daytime sky is blue, and why sunsets are red. 
It is very important to note here: in the Rayleigh scattering process we have described, there 
is not net transfer of energy from the light to the electrons. This is an elastic scattering 
process, and we see two important facts: the energy in the scattered radiation is equal to that 
lost in the incident radiation; the scattered wavelength is equal to the incident wavelength. 

Near the resonant frequency we have w ~ wo. We obtain 


piles (6.153) 


This cross section describes so-called resonant scattering, and the cross section for this is 
large because y is typically small. Most interesting however is the high-frequency case where 
w >> wo > y. We now obtain a very simple form for the cross section, which is 


Brr? 
3 


This cross section may also be written equivalently in terms of the other fundamental 
constants as 


or = (6.154) 


et 


oT = 6.74 x 107°° m? = 0.0674 barns. (6.155) 


 6me2m2c4 
The barn unit is convenient for scattering calculations; 1 barn= 10~?°m?. There is no 
frequency dependence at all in this expression; the likelihood of scattering does not depend 
on the incident photon frequency as long as it’s high enough. The cross section is given 
the subscript T because this regime is known as Thomson scattering; the behaviour of the 
scattering cross section with incident frequency is shown for atoms in Fig 6.20. We recall 
that since w is very large, incident radiation does not see that the electrons are bound, so 
that we have free electrons; therefore, as well as describing the scattering of high-frequency 
radiation by atoms, this cross section also describes the scattering of radiation from free 
electrons. It is therefore important when considering the mutual passage of photons over 
electrons in certain laser-plasma interactions. We saw above that (S') = c(U), so therefore 
the power emitted by an electron due to Thomson scattering is 


P=orcU). (6.156) 


The emitted power is due to the electric field energy passing over the electrons at speed c. 


6.3.2 Synchrotron Radiation and the Field Energy 


One reason for deriving the scattering cross section of photons from electrons is to look once 
more at the synchrotron radiation emitted power. We saw earlier that the power emitted 
by an electron moving in a magnetic field with radius p is 


e2 cbt’ 


P= x 
67r€9 p? 


(6.157) 
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FIGURE 6.20 Different regimes for scattering. At low frequencies (large wavelengths) there is Rayleigh 
scattering whose cross section varies as wt. At high frequencies the cross section tends to the Thomson 
cross section OT. In between there is a resonant region of width Y. 


Let’s write the synchrotron radiation power in terms of the Thomson cross section, which 

is 

mec? B44 
e2p2 

We can then write the emitted power in terms of the magnetic field B by remembering that 

p = Bymc/eB, so that 


P=or (6.158) 


P = o7 B’ By’. (6.159) 


We saw for Thomson scattering that P = orc(U). Let’s write the synchrotron radiation 
power also in terms of a field energy density — this time, the energy density of the magnetic 
field B. The energy density in the magnetic field is Ug = B? /2u0, which gives an expression 
for the synchrotron radiation power as 


P = 2orUgeoo B77’. (6.160) 
We recognise that couo = 1/c?, so that finally 
P = 2o07cUp py’. (6.161) 


What does this expression mean? We can regard the electron as taking energy from the 
magnetic field at some rate or, where the magnetic field has an effective Poynting flux 
Spg = cUg; the difference is the extra factor y? from the motion of the electron. 


6.3.3 Thomson and Compton Scattering 


In our scattering derivation above, we calculated the rate of scattering for high-frequency 
radiation; this was the Thomson scattering cross section. This is an elastic process in which 
the incident and scattered wavelengths are the same. However, we also know that individual 
photons carry momentum, and therefore should transfer some of that if they interact with 
an electron; this is the process that we call Compton scattering. Clearly, there must be some 
way of reconciling these two phenomena; we realise that Thomson scattering applies as long 
as the energy of the photon is much less than the rest energy of an electron, in other words 
hf <m,c?. At higher frequencies the momentum transfer starts to become important and 
we have Compton scattering. In ordinary Compton scattering, a high-energy photon (with 
energy ¢;) is incident upon a stationary electron; the photon is scattered by an angle 8 
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causing a recoil of the electron. The scattered photon therefore has a lower energy ef and 
a longer wavelength Ap, given by the standard Compton formula 


Af — ài = Ac (1 — cos 8) (6.162) 


where 


h 
Ne = ~ 0.002 nm (6.163) 


MeC 
is the Compton wavelength; this process is shown schematically in Fig 6.21. We may measure 
the electron mass by measuring the energy change of photons at a specific scattering angle 
G8, and for practical values of this angle we have the requirement that A; ~ Ac; in other 
words, the energy €; of the incident gamma ray photon should be comparable to the rest 
energy Mec? of an electron. 


hf e 
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hff 
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FIGURE 6.21 In the quantum picture of scattering, an incoming photon of frequency f; is scattered 
through an angle 8 into a scattered photon of frequency fr. In the case of Thomson scattering, where 
hfi K Me Cc, there is no appreciable transfer of momentum from the photon to the electron, and therefore 
ff = fi; this is an elastic scattering process. When there is an appreciable transfer of momentum, fr < fi 


and we call it Compton scattering. 


It can be shown that the total Compton cross section tends to the Thomson cross section 
for low-frequency photons. Defining the so-called recoil parameter as 


= (6.164) 


a quantum electrodynamics analysis [16, 17] yields the following exact expression for the 
Compton cross section: 


3 4 8 1 8 1 
rc = OT Ty (2 x x) log(1 + X) 4 5 X XP] (6.165) 
The scattering of long-wavelength light from electrons implies X « 1, for which oe œ 
or(1 — X); at long enough wavelengths (X — 0), the Compton cross section tends to the 
Thomson value as it should. A significant recoil parameter can be obtained if gamma rays 
are incident upon a stationary electron; for example, the gamma rays from the decay of 
cobalt-60 have energies e; ~ 1 MeV, which yields a recoil parameter of X ~ 7.82. In this 
case the Compton cross section is oe ~ 0.2207, a substantial reduction. At these larger 
values of X the Compton cross section tends to 


1 
oc Yor (= how x + 3) ; (6.166) 


which is accurate to about 10% when X > 2. 
We should also compare the difference in the angular distributions of the Thomson- 
and Compton-scattered radiation fields. Thomson scattering has an intensity distribution 
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like a Hertzian dipole (for incoming photons that are polarised), i.e. I œ cos? 8 (where 3 
is the angle of observation compared to the direction of incident radiation); the scattered 
radiation has the same wavelength as the incident wavelength, and is polarised in the same 
direction. The proportion of scattered radiation does not change with incident wavelength. 
In contrast, the Compton scattering intensity distribution is peaked in the forward direction 
(this is compared to the Thomson rate in Fig 6.22), and scattered photons have wavelengths 
that are generally larger because of the momentum transfer; the wavelength varies with angle 
to conserve momentum. As the wavelength reduces, so does the rate of scattering. 


FIGURE 6.22 Rate of Thomson (classical) scattering and Compton scattering (for 1 MeV photons) as a 
function of scattering angle 8, obtained from the Klein-Nishina formula. The Compton-scattered photons are 
forward-peaked due to the conservation of momentum. The dotted ‘peanut’ shape for Thomson scattering 
differs from the Larmor formula; the Larmor formula is the scattering rate for polarised photons, whilst the 


Thomson scattering rate shown here is the rate for unpolarised photons. 


6.3.4 Inverse Compton Scattering (ICS) 


We have seen that in ordinary Compton scattering — where the electron is initially stationary 
— that the scattered photon always reduces in energy. Inverse Compton scattering is the 
situation where the electron is moving sufficiently fast that a collision may cause the photon 
to increase in energy. For this to occur, the electron typically must be moving relativistically 
with y > 1. We will see that an incident photon can be scattered to a much larger outgoing 
energy. 


Energy Change from the Inverse Compton Process 


We consider an electron moving with velocity v where y > 1, and an electromagnetic wave 
is incident upon it at some angle 0 to the direction of the electron. 0 = 0 corresponds to 
the photons being incident head-on with the electron. 

Since the electron is moving relativistically, we must perform a Lorentz transformation 
of each photon frequency f in the laboratory frame to the frequency f’ in the electron’s rest 
frame. For a head-on approach of the electron and the photon, we may write the ordinary 
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relativistic Doppler formula for the frequency change of the photon as 


Eo __ 1+8 _ (1 + 8)? 
70t pow a+ RB) nn 
1+ 8 


For a photon approaching the electron at an angle 0 to the head-on direction, it can be 
shown that the frequency change is given by 


L = y(1 + 8 cos 8) (6.169) 


or equivalently that the photon energy changes as 
e = ey(1 + Bcos@) (6.170) 


Also, the apparent angle of incidence 6’ of the photon upon the electron (in the electron’s 
frame of reference) is related to the laboratory-frame angle of incidence by 


. fo 3 
sin’ = wl + B cone)’ (6.171) 
j cos 6+ 8 
= 172 
cos 0 (LF Boost) (6.172) 


We see that when @ is small and £ is large (i.e. the electron velocity v œ c), the apparent 
change in frequency is 
f ~ 2yf. (6.173) 


As an example, we consider an electron with kinetic energy T = 1000 MeV, so that 
y ~ 1957 and 6 = 1 to a very good approximation. Visible photons of wavelength 500 nm 
are incident upon the electron, so that e; = hc/A = 2.48 eV. The Doppler shift into the 
electron’s rest frame changes the photon energy to e; = ye; = 9707 eV. Hence we see that if 
the electron energy is less than ~ 1 GeV and the incident photons are near-visible (e; ~eV), 
in the rest frame of the electrons e; = hf’ < mec? is still small in comparison to the electron 
rest energy. There is therefore no significant transfer of momentum to the electron, and we 
have ordinary — essentially isotropic - Thomson scattering where the scattered energy (in 
the electron rest frame) is e} = €;. When we transform ¢; back into the laboratory frame 
another Doppler shift is performed. For the head-on case (0 = 0) we see that the outgoing 
photon energy is 
ep = (14 8) ei ~ 4 ei. (6.174) 


This is a very important result. A typically-used incident laser wavelength is 1064 nm (near- 
infrared, corresponding to e; ~ 1 eV); Compton scattering from 50 MeV electrons means that 
the scattered photon energy is around 10 keV (suitable for X-ray scattering experiments), 
from 500 MeV electrons we obtain ~ 1 MeV photons (suitable for exciting resonances in 
atomic nuclei), and 5000 MeV electrons deliver ~ 1 GeV photons! In other words, ‘ordinary’ 
electron energies (up to ~GeV) can be used to generate Compton-scattered photons with 
energies extending far above those available from other sources. Moreover, the generated 
photon energies are tuneable, and this is mostly achieved by varying the energy of the 
electrons rather than by varying the energy of the photons [18]. 


262 The Science and Technology of Particle Accelerators 


A more careful analysis of the scattering process yields the better formula 


47 ¢; 
FT (70)? + Aye (mec?) 


(6.175) 


where @ is the observation angle; the second term in the denominator therefore tells us that 
the produced photons are monochromatic to within some bandwidth as long as the angular 
spread of photons seen by the observer is restricted — we may collimate the scattered photons 
to select a desired bandwidth of photon energies. Above energies of around 100 keV there 
is no alternative source of near-monochromatic photons, and therefore inverse Compton 
scattering is an important method. The third term describes the degree of electron recoil, 
which as earlier, results in a reduction in the scattered photon energy; for 1 eV incident 
photons, this recoil parameter is small even for large electron energies of ~ 1 GeV, but can 
be significant if the incident photons have keV or higher energies [19]. 


Inverse Compton Scattering Cross Section and Output Power 


We recall that the scattered radiation power in Thomson scattering for an incident power 
(S) is just 
P' = or (S') =orc(U') (6.176) 


where (U’) is the average energy density in the incident electromagnetic wave (in the elec- 
tron’s frame of reference). The instantaneous radiated/scattered power from the Thomson 
scattering is quasi-isotropic in the electron’s rest frame with the usual Hertzian dipole pat- 
tern 

@|a?(t — r/c)| cos? ¢ 
7 16r2eoc’r2 : 
where €’ is the angle of the emitted radiation from the electron with respect to the inci- 
dent photon direction in the electron rest frame. In the laboratory (observation) frame the 
angular and energy distribution of the scattered photons is therefore effectively completely 
determined by the relativistic Doppler transformation, known as a kinematic restriction. 

We note that the Thomson-scattered power is defined as a rate of energy emission, and 
hence the total scattered power is invariant under a Lorentz transformation. Hence the 
total emitted power P’ in the electron rest frame is the same as the emitted power P in 
the observer’s frame. To calculate P we need only to calculate P’ and therefore to calculate 
the energy density U’ of the photons in the rest frame of the electron. To do this, we first 
note that photons of a given frequency f and volumetric number density n have an energy 
density 


|S'(r, t)| 


(6.177) 


U=nhf (6.178) 
so that the incident flux is 
S=Uc=nhfe. (6.179) 


The interval in the arrival time of these photons at the electron is reduced in the electron’s 
rest frame by the Doppler shift, and so in the head-on case the effective number density of 
the photons increases to 

n' = ng(1 + 8). (6.180) 
We earlier showed that the Doppler shift increases the photons’ apparent frequency to 


f' = fy(1 + 8), so that 
U' = UP (1+ 8}. (6.181) 


The Thomson-scattered power in the rest frame of the electron is now just 


P! = ofc!’ = op "(1 + 8)’, (6.182) 
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which is the same as the power in the laboratory frame, i.e. P = P’; this is the total power 
contained in the scattered photons, which now have higher energies than they did before. 
However, those photons already had an initial power 


Pinitial = orcU (6.183) 
before they interacted. So, the net power given to those interacting photons is 


Pics = P' — Pinitiat = or Uy? (1 + p}? — orcU 
= orch? + 8)? = 1] 


= ory ja + B)? -— z 
y 
= orc y"[(1 + 8)? — (1 — 8°)] 


= 2orcU y’ p(B +1). (6.184) 


Inverse Compton Scattering Flux 


We know that the energy density U in the (incoming) photon beam is given by the photon 
density, which in most practical situations has a Gaussian transverse profile of some size 
oy. Assuming the photons are scattered directly backwards, it is possible to show that the 
number of Compton-scattered photons is 


Ne N; 
Ny = or, 6.185 
f z 2n(o2 + 0?) ( ) 
where Ne, Nz are the numbers of electrons and photons, respectively, focused into circular 
spots of r.m.s. size ce and ay. We see the same basic scaling with photon number Nz. 
We may alternatively express the laser power in terms of its normalised vector potential 


eA 
= —> Al 
a Ge (6.186) 
where the associated field strength parameter is 
vidi = 
ao = 0.855 x 107° VTA; (6.187) 
2TMeC 


Eo; is the maximum strength of the incident laser electric field, and ag has been expressed in 
convenient units in which the incident laser intensity T is given in W/cm? and the wavelength 
A; is given in um. One may obtain the number of Compton-scattered photons as 


2 
N; = gra nage, (6.188) 


where N, is the number of wavelengths in the incident laser pulse; again we see the scaling 
with number of incident photons as expected. This expression is only valid for the linear 
regime where ap < 1. Multi-photon scattering occurs as the intensity approaches ag ~ 1, 
such that higher-energy scattered photons can be obtained. A number of methods and codes 
are available to estimate the ICS flux, and a good summary is given by Krafft and Priebe [18] 
that includes a number of useful approximations. 
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6.4 Radiation Damping 


In general, radiation damping is the phenomenon of the reduction of some charge’s oscilla- 
tion amplitude due to the emission of radiation. For example, in the classical description of 
an electron orbiting an atomic nucleus, the (classical) emission of radiation by the charge 
would cause it to spiral inwards into the nucleus in some time t ~ 1078 s. In particle accel- 
erators, the term is used to describe the effect of what is basically a quantum phenomenon; 
it arises particularly in the context of electron storage rings, an important case we describe 
here. 

An orbiting electron in a storage ring emits photons continuously with the spectrum 
derived in Section 6.2.3; the photon emission is quantised, which means the energy change of 
the electron is discrete (rather than smoothly changing). At the moment of photon emission 
the electron’s energy changes by a finite amount and the electron experiences a (small) recoil; 
more importantly, the electron has a new, lower, energy and — if the dispersion function 7 
in either plane of motion x or y is non-zero — the electron will now start to oscillate with 
respect to a new closed orbit. This is the phenomenon of quantum excitation; obviously, to 
limit the amount of excitation a good storage ring design should limit the typical values 
of 7. Storage rings are usually planar (i.e. bending only in the x direction) so that the 
photon emission is mainly in the x plane, and also 7, is essentially zero; quantum excitation 
essentially only occurs in a storage ring in the x plane and gives an effective horizontal 
momentum. 7, is limited by making use of achromats (double bend achromats, triple bend 
achromats, etc.) to limit the size of 7, that is generated in the dipole magnets, and hence 
limit the excitation. 

Radiation damping competes with this excitation process; in a storage ring we are im- 
plicitly saying that energy loss from radiation is replaced by re-acceleration of the electrons 
by means of RF cavities; the re-acceleration is only in the beam direction, so any prior trans- 
verse momentum will be steadily damped. For example, in the Diamond storage ring, each 
electron loses about 1 MeV per turn; the actual voltage supplied is somewhat larger, firstly 
because the quantised emission gives rise to a typical energy spread and also because scat- 
tering (mainly Touschek scattering) gives rise to electron energy changes of ~ 1% that must 
be tolerated. Without re-acceleration, the electron lifetime would be r ~ E/Uotr ~ 5 ms; 
with re-acceleration the typical time for electrons to damp to some equilibrium value is 
the same — this is the idea of the damping time. Quantum excitation competes with radi- 
ation damping to give an equilibrium oscillation amplitude — this is just the equilibrium 
emittance; in the absence of quantised emission, the equilibrium emittance would be zero. 


6.4.1 The Radiation Integrals 


Matthew Sands derived a useful formalism for describing the effect of quantised radiation 
not only for the equilibrium emittance but also for other associated electron beam parame- 
ters [20]. The quantities he obtained are known as the radiation integrals, and Sands derived 
five expressions but today many use a sixth [5]; the original integrals also assumed horizon- 
tal bending only. The complete set of radiation integrals (now allowing for vertical bending 
as well) are 
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Zen{m~~] = ce F k'ny)” ds, 


Tey[m*] = f Em — kn}? ds, (6.189) 


(6.190) 


where the integrals are each evaluated over a single turn of the storage ring; k is the 
quadrupole strength, k’ the skew quadrupole strength, and 1/p° = 1/p;, + 1/p;. We see 
that the Z5, and Tsy integrals — which describe the quantum excitation — are dependent on 


the functions H,,(s) and H,(s), which are determined by the Twiss functions and dispersion 
as 


He = Betty + Aae Ny + YaNg 
Hy = Byng + 2aynyny + Yyy: (6.191) 


These are fairly involved general expressions, but they simplify considerably in the (very 
typical) case where there is only horizontal bending and no focusing in the bending magnets. 
Then Liz = $(nx/p3)ds, Lay = 0, Tsy = 0, and Tes = Tey = 0. With this formalism, the 
synchrotron radiation power (per electron) may be expressed as 


E*T, 

P= 6.192 
Y Ont, ( ) 

where the so-called quantum constant is 

55A 
Cy = — = — 7x 3.8319 x 107 m. (6.193) 

32\/3m-c 
Hence the energy loss per turn 
EI  2r.E*I, 

= = $ wl 4 
ey 2T 3m3c6 (6:194) 


There are three damping times — one for each direction of electron motion z, y, s with respect 
to the moving electron bunch centre — which are 


3mec? Lpr 


BA ee 6.195 
2nreJIz,y,sE ( ) 


Trgi = 
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where L is the storage ring circumference. Jy, Jy, Js are the damping partition numbers 
obtained as 


Lax 
Jn =1- = 
Jn = 2+ Veet ty (6.196) 
Ty 
leading to the Robinson Sum Rule 
Je + Jy + J, = 4. (6.197) 


In our planar ring situation we have Jy = 1, and usually Z4; < Zz so that J; ~ 1 so 
that longitudinal oscillations (of the energy) have twice the damping time of the lateral 
amplitudes.* For example, in the 96 m circumference Daresbury SRS at its injection energy 
of 600 MeV and with a dipole bending radius of 5.56 m (B = 0.36 T), the damping time 
Ts = 93 ms; after ramping of the dipole field to 1.2 T to circulate 2 GeV electrons, the 
damping time reduces to 2.5 ms. Another way to state the damping times when J, œ 1 is 


3tr Tx 3tr 
Ta S Tea eS SS i 
OOY nh 2 rT 


(6.198) 


The electrons emit photons at a rate N = 5ray/ /3t, as derived earlier. The RMS 
photon energy (e?) = 11e?/27, so that in a planar ring the rate of photon production is 


N (e) = D y m aR (6.199) 
24/3 [oz] 

The induced energy spread also therefore scales œ y”, and over some distance L (such as 
the ring circumference) is 


2 _ 55a?" fe 1 


Ao ds. 6.200 
B= asva h Teal sia 
This in turn gives an emittance growth 
Srey? fe 1 
ea SE i; sas. (6.201) 
24,/3mec 0 A He 


6.4.2 Equilibrium Properties 


The above expressions can be used in any electron accelerator to determine the emittance 
and energy spread growth, whether it’s a ring or not. In a storage ring, however, an equi- 
librium is formed when the excitation rate equals the damping rate, such that the energy 
spread becomes 

T3 


= C,7?7 7 
Y Tz + Taz 


oO 
a (6.202) 


*To show that Z4, < T2, consider a typical storage ring where $ ds = L ~ 100 m, nz ~ 0.1 m and 
Px ~ 10 m. 
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(assuming here again that Z4, = 0). Hence the relative energy spread in a planar ring is 


OF Cy 
— x ,/——. 6.203 
F F (6.203) 


Note that this is the energy spread of one electron, i.e. the typical range of energies of that 
electron over time. We saw that in a typical storage ring, there are ~ 10! electrons, each of 
which independently has an energy spread og/F so that the whole beam has that energy 
spread. The corresponding bunch length depends upon the momentum compaction factor 
a; that couples energy to longitudinal position (Equation 5.159 in Section 5.7.3). Thus we 
have 

_ elned oz 


s , 6.204 
m= T (6.204) 


where the phase-slip factor ne = &e — 1/7? and ws is the synchrotron frequency; in electron 
rings Ne ~ Qe since y is typically a few thousand. The synchrotron frequency — the rate at 
which electrons oscillate back and forth within the electron bunch — is 


ach cos(¢s)eVrr 


Uo 
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Ws = Wp sin(¢@,) = a (6.205) 
where @¢, is called the synchronous phase and q = eV;/Uo is the overvoltage; wr = 27/t,. 
is the electron angular revolution frequency. The overvoltage gives an energy acceptance — 


here called the RF acceptance — of 


oi 2Uo 5 eyo! 
ERF = TE | q? — 1 — cos (<)|. (6.206) 


erp is typically several percent to accommodate Touschek scattering between the electrons 
in a bunch (see Chapter 7). Using the radiation integrals, we have simply that 


7 (6.207) 


Qe 
We find in many electron storage rings that œe ~ 1074, although it can take different values 
(including negative ones) depending upon the average value of 7, in the dipole magnets. The 
equilibrium emittance is obtained when there is a balance between the quantum excitation 
rate and the emittance damping, such that de, /dt = —2¢,/7,. From this we can obtain the 
natural emittance value (in a planar ring) as 


(6.208) 


showing the contribution of the Z5, excitation and Zz terms. We label this cezo the ‘natural’ 
emittance, and in the absence of field errors we have eyo = 0 (there is actually a lower limit 
on the vertical emittance due to the photon emission, which is rather small [5]). Typical 
values of the natural emittance in modern synchrotron light sources are from 0.1 nm-rad 
to 10 nm-rad, for electron energies around 1 to 8 GeV. A central activity in the design 
of new electron storage rings is to generate as small a value of Zs, as possible; it has 
proven advantageous to split the bending dipoles into as many pieces as possible to allow 
quadrupoles to be interleaved to minimise the average dispersion (nz) that drives Tss, 
creating what are known as multi-bend achromats (MBAs). The consequence is the strong 
non-linear limitation to the dynamic aperture brought about by having so much strong 
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focusing to correct the dispersion; a number of beam-optical cancellation schemes have 
been proposed, primarily based upon setting an appropriate phase advance (for example 
u = T) between the non-linear kicks. 

In real storage rings, small vertical B fields from dipole and quadrupole misalignments 
weakly couple the horizontal and vertical planes of motion; hence the vertical emittance is 
not zero. The horizontal and vertical emittances may be written as €z + €y = €z0; for a weak 
coupling factor & we can write approximately 


Ex ext, €y © K€x0- (6.209) 


Today, position alignments of 10s of um and roll accuracies of 10s of urad allow « ~ 1073 
to give very small vertical emittances in the picometre regime. Unintended vertical bends 
give small residual vertical dispersion of typically a few millimetres, which also contributes 
to ‘effective’ coupling, and so instead, the term emittance ratio is used interchangeably to 
describe €y/€,. 

It is common also in electron storage ring design to separate the emittance contributions 
from the dipoles and insertion devices, in particular the multipole wigglers (MPWs) where 
both Tsx and Tə can potentially be significant. Strong MPWs (i.e. many poles and high field) 
are used to maximise Zz and hence minimise the emittance; this damping wiggler technique 
may be used either in storage rings (where an equilibrium is obtained) or in damping rings 
where the MPWs help an initially-injected large emittance to be damped to a small desired 
value as quickly as possible. Separating out the contributions of the wigglers (labelled ‘w‘) 
from the rest of the ring (labelled ‘0’) we can simply state 

V Dyn + Isr 
Ew = a ia (6.210) 
where €w indicates that this is the emittance with MPWs. The benefit of the MPWs can be 


written as 
Ew 1+ TE, | To 


= 6.211 
We may re-state this ratio in terms of the MPW properties as 
4Cy (Bx) .2 Po 93 
Ew = : a 15T Je NP cope Vr Pu bw (6.212) 
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N, is the total number of MPW poles, (8+) the average horizontal beta function in the 
MPW, pw the minimum MPW bend radius at the peak field Bu, and 04 = Aw/27pPw is the 
peak deflection angle in the MPW, where Aw is the MPW period. The MPW bend angle 
must be limited to avoid too great a ‘self-dispersion’; there is a maximum field for which 
the MPW does not reduce the emittance and for which €,,/€29 > 1. Damping wigglers are 
ideally long, and with lower field. 


6.5 Bremsstrahlung Radiation 


A quite different practical phenomenon from synchrotron radiation, but one that is ulti- 
mately derived from the same basic physics, is bremsstrahlung. The word bremsstrahlung is 
German, and was derived from the words ‘bremsen’ (to brake) and ‘strahlung’ (radiation). 
Bremsstrahlung is therefore ‘braking radiation’. Bremsstrahlung is the name given to the 
phenomenon whereby a charged particle is caused to radiate (and therefore lose kinetic 
energy and slow down) due to it passing close to an atomic nucleus, and so feeling a very 
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strong electric field. A strong electric field at right angles to the particle’s motion causes 
much the same thing as a strong magnetic field: electromagnetic radiation is emitted [1]. 

A common example of bremsstrahlung is in radiotherapy, in which a patient’s cancer 
is treated with X-rays. These X-rays are generated using bremsstrahlung: electrons from a 
suitable accelerator are directed into a metal target (usually something like tungsten that 
has a large atomic number Z, or some other refractory metal"); this is shown schematically 
in Fig 6.23. The electrons may have an initial kinetic energy of, say, 10 MeV, and when 
some of those electrons pass close to one of the atomic nuclei, they experience a strong 
(transverse) force from the electric field of that nucleus. This causes the electron to radiate 
photons. Obviously, the largest photon energy that is emitted cannot be greater than the 
initial kinetic energy of the electron; most of the time, the electrons pass somewhat further 
away from the nuclei and emit lower-energy (‘softer’) photons. Overall, a broad spectrum 
of radiation is emitted, with a larger number of softer photon energies. 

A high Z obviously gives a stronger nuclear field, and a high density p means there 
are more nuclei per unit volume to hit.* As an electron passes by a nucleus, its distance 
from the nucleus obviously changes, and hence so does the electric field seen by the electron 
(Fig 6.24). The closest distance is called the impact parameter, which we label b. The varying 
electric field seen by the electron gives a varying acceleration, although the overall effect 
is that the electron is deflected by some angle between its initial velocity v and its final 
velocity v’. The power emitted by the electron as a function of time is just the same as with 
any other acceleration: 


P(t) = maa’). (6.213) 
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FIGURE 6.23 Radiotherapy is a use of bremsstrahlung radiation; electrons generate X-rays as they are 
accelerated by the nuclei of the metal atoms in the target. A broad spectrum of X-ray emission is observed 
— a large number of low-energy photons and a small number of high-energy photons; the maximum energy 
of the emitted photons is very nearly the initial kinetic energy of the electrons. 


The Electron-Ion Collision 


Whilst the electron does not strictly collide with the atomic nucleus, we nevertheless still 
call it an electron-ion collision (the ion being the positive nucleus bit). We may calculate 
features of the output spectrum of the emitted photons as follows: 


*The so-called refractory metals are those metals with extremely high melting points; these are tungsten 
(W), tantalum (Ta), rhenium (Re), molybdenum (Mo) and niobium (Nb). They are also physically quite 
robust. 

* Although you should take account that high-Z atoms also have a larger mass number A; a good exercise 
is to compare the atomic number density of some common metals like aluminium, tungsten, and lead. 
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FIGURE 6.24 Definition of the impact parameter b for an interaction of an electron with an ion. 


e For a large enough impact factor b (which is basically always true), we can make the 
statement that the nuclear electric field E L v; therefore the nuclear electric field does 
no work on the passing electron. 


e The acceleration a given to the electron varies over the course of a nuclear collision; the 
typical time over which the collision takes place is At ~ 2b/v, and so the frequency of 
the photons emitted is spread over values from zero to fmax ~ v/2b. 


e The maximum photon energy must be less than the initial kinetic energy Ep of the 
electron. 


One very simple law (the Duane-Hunt Law) is that the cut-off frequency above which no 


photons are emitted is 


h 
Next, let’s look at the case of large-enough impact factor b such that |v| ~ |v’|; the electron 
is only slightly deflected by the nucleus and doesn’t change much in energy due to the 
collision. We may then obtain the distance from the nucleus as a function of time as 


r(t) = fe + v2 (6.215) 


where r = b at t = 0 (Fig 6.25). Obviously, for a nuclear charge Q, the acceleration 
experienced by the electron is 
1 eQ 


esa (6.216) 


Me AtrEgr2’ 


(6.214) 


Ve 


so that the emitted power as a function of time is 


e*Q? 1 


P(t)= . 21 
(t) 9673 €8m2c3 (v2t? + b2)? (ee 
The total energy released as photons is 
ee) tQ? 1 

W = P(t)dt = 6.218 
[. (6) 19272e8m2c3 vb3’ ( ) 

since z i 

T 

d= ; 6.219 
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W is measured in joules per electron. 
If we have a beam of electrons,” then each electron will have a different closest distance 
b from a nucleus. Obviously, larger values of b are more probable, with that probability 


*And there are a lot of electrons passing through a radiotherapy target! 
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FIGURE 6.25 In a shallow collision there is a small deflection of the electron, and we can write the 


distance of the electron from the ion as r(t) = Vb? + v2t?. 


x 2rbdb (Fig 6.26). Let’s therefore try to calculate the overall energy emitted per unit 
length traversed by electrons in a target, by integrating over the likelihood of having a 
certain b value. For Ne electrons in the beam, and a number density per unit volume of 
nuclei in the target N;, the energy loss per unit length of target the electrons move through 
is 


bmax 
oF a5 f N.W 2rbdb 
di ; 


min 


(6.220) 
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If we know the numbers of electrons and target nuclei, we can work out the bremsstrahlung 
power; except, what values of bin and bmax Should we use? We have to pick some. Looking 
carefully at our expression for dE/dl, we see that placing an upper limit bmaz = œ is fine; 
electrons can, in principle, travel very far from the nucleus. But what about bmin? If we 
were to allow a very close approach bmin — 0, this would lead our expression to diverge 
and predict an infinite emitted power; clearly this is not okay and must be unphysical. We 
recall that our original assumption that |v| ~ |v’| must break down at small values of b; 
another way of saying this is that the total energy emitted must be less than the initial 
kinetic energy Ep of the electron. If we choose a value for bmin, we may obtain the radiated 
power P = dE/dt for electrons of velocity v as 


N,N-e2Q? 1 
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Notice how v has disappeared from the denominator when we went from writing dE/dl to 
writing Phrem- 


FIGURE 6.26 The likelihood of an impact factor b is x 27bdb. 


However, our calculation of the power is still deficient: we don’t know what bmin to use. 
We can’t easily improve on it unless we do a full quantum calculation (which we won’t do 
here), but we can get an idea by placing a couple of limits on bin; this is equivalent to 
calculating the emitted power up to some maximum photon energy, and is still useful. One 
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limit is to cut off our calculation by saying our integral becomes invalid when Av ~ v. We 
can calculate what this change in v is as 


Qe T b 2Qe 
Av = = t= : 222 
° Me Jt=—00 (vt? + pare" mebu (6 ) 
Hence the limit to apply is 
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Another limit one could apply is when quantum effects become significant, in other words, 


that 
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MeV 
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bmin (6.224) 
In summary, we have tried to do a classical calculation of what is really a quantum process. 
The method is deficient because of the question about what bmin we can use; in practice 
people typically set bmin = h/mv (the quantum limit). This gives 
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Notice something important here: the radiated power strongly depends on the charge of the 
nucleus. We can re-write the bremsstrahlung power in a more convenient form for singly- 
charged ions as 

Prrem ~ 1.85 x 10738 N,N; V Ep Wm, (6.226) 


where Ep is the kinetic energy of the electrons in eV [1]. 

We have found in this discussion that a classical derivation of bremsstrahlung radiation 
power gives us some idea of what is going on — we can obtain the right spectrum and we 
can place limits on the classical integral that give about the right power. In practice a so- 
called Gaunt Factor is used to describe the numerical factor difference between the classical 
calculation we have just done, and the proper quantum calculation. 


Examples of Bremsstrahlung 


Electron bremsstrahlung from X-ray tubes is an instructive example. The photon spectrum 
from an X-ray tube contains contributions from the electron bremsstrahlung of accelerated 
electrons impinging upon the thick (in most cases) anode target, and also lines of X-ray 
transitions of the atoms of the anode material. Here we ignore the X-ray transitions, and 
discuss the bremsstrahlung part of the tube emission spectrum. We saw above in our classical 
calculation that the rate of energy loss into the target dE/dl «x N;Q? (where Q was the 
charge of the ion). The probability of bremsstrahlung emission from a moving charge q of 
energy F is proportional to q? Z?E/m?. Again we see that electrons impacting upon high-Z 
materials will generate more radiation, and electrons generate about (m,/m-)? ~ 3 x 10° 
times more bremsstrahlung than protons do, because they have much less mass. The energy 
loss in a target due to bremsstrahlung is, according to the quantum Thomas-Fermi model, 


dE 
dl 


183 
~ —d4ar? No EZ? In AVES (6.227) 


Hence the bremsstrahlung loss is proportional to energy, and we can write 


ia 7 (6.228) 
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where the radiation length Xo is given by 


lt. 2 2 
a dar? NoZ* In ZI (6.229) 
The radiation thickness is defined as 
A 
Ly = pXo = (6.230) 


4ar2NoZ? In []183/Z1/3]’ 


where p is the target density and A is its atomic mass. 
We already saw that the maximum photon energy was limited by the energy E of the 
incoming electrons. E ~ U where U is the tube voltage, and so the Duane-Hunt law can be 


stated as 
he 


eU’ 
where Athresh is the minimum possible wavelength emitted. The higher the voltage, the 
smaller Arpresh is; we can write this in convenient units as 


Athresh = (6.231) 


Athresh[nm] = 1.239 x 10° a (6.232) 

Another example is so-called free-free emission in a plasma, so called because the elec- 
trons are free to move both before their encounter with an ion and after that encounter; there 
is no capture of the electrons by the ions. Both the ions and electrons in a finite-temperature 
plasma can see (‘encounter’) both ions and electrons and thereby see accelerations and radi- 
ate. However, we know of course that me K Mion, and so the only significant radiation from 
an encounter is when the electrons are accelerated by encounters with the ions — the elec- 
tron acceleration is much larger than the ion acceleration. Hence in a plasma the electrons 
radiate bremsstrahlung and the ions do not; the ions cause acceleration and the electrons 
don’t. Knowing it’s the electrons doing most of the radiating, plasmas behave basically 
the same as electrons passing through a target and we may straightforwardly calculate the 
bremsstrahlung power for the free-free radiation in the same way as we did previously, using 


N.N,e*Q?v? _3 
—_.—.— Wm ”. 


2 
48e8meckh (6233) 


Porem = 


Here, Ne is the density of the free electrons, and N; is the density of the ions. There can 
be more electrons than ions as long as the plasma is neutral overall. Usually, Ne = N; as 
might be expected. 

Our last example involve the tokamaks proposed for fusion power, which typically con- 
tain a large volume of plasma; we wish to maintain a high plasma temperature, but un- 
fortunately the plasma is cooled by bremsstrahlung that occurs as electrons pass close by 
nuclei. An interesting feature is that the bremsstrahlung power Prem x Q?, where Q is the 
ionisation state of the plasma ions. One example of a tokamak is JET, the Joint European 
Torus. This has a plasma volume of around 100 m3. A typical temperature during JET 
operation might be 100 million kelvin (108 K), which corresponds to a kinetic energy of the 
electrons of around 13 keV. The number density of the electrons/ions in the plasma during 
fusion would be around 102° m~? = 10 cm~’. This density should be compared to the 
density of a typical solid material, water, which has pN4/M ~ 10? molecules in a cubic 
centimetre. A ‘high-density’, ‘high-temperature’ plasma is therefore still rather tenuous, and 
contains low-energy electrons. Using these values for the JET tokamak, we predict that the 
lost bremsstrahlung power is quite high: 2 MW. Therefore, it is hard to keep a tokamak 
plasma hot because free-free radiation will cause it to cool itself down. 
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Electron Bremsstrahlung Spectrum 


An important practical situation is that of the bremsstrahlung generated by electrons in 
high-Z targets, for example the generation of X-rays in the head of a radiotherapy machine. 
Already in 1959 Koch and Motz tabulated convenient formulae [21] to estimate the photon 
output as a function of energy and emission angle, later augmented by Berger and Selzer for 
specific metal targets such as tungsten [22]. An example is shown in Fig 6.27. Zschornack’s 
handbook of X-ray data contains a wealth of useful information [23]. In addition, a number 
of codes — including earlier ones such as EGS and more modern ones such as GEANT4 — 
enable the calculation of photon output, normally using Monte-Carlo sampling methods; 
care should be taken when using such codes that sufficient accuracy is used for the cross 
section, geometry, number of primaries simulated and simulation ‘cuts’ that reliable results 
are generated. Alex Bielajew’s numerous publications should be consulted [24]. Nordell 
and Brahme [25] augment the earlier results of Stearns [26], and in particular they give an 
estimate of the angular spread of the photons, 0p, due to the bremsstrahlung process, whose 


RMS value is 
Mec 


Te 
ln 5 
E Me (6 


prms = k (6.234) 
where T, is the electron energy and k ~ 0.26 is an empirical factor derived from measure- 
ments. To the spread in photon angle must be added the angular spread due to electron 
scattering in the target and the initial angular spread of the incident electrons. 
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FIGURE 6.27 Comparison of the energy-resolved photon production from a (thin) 1 mm tungsten 
target, for incident electron energies of 20, 30, 50, and 100 MeV. The solid line shows the Bethe-Heitler 
formula as given in Koch and Motz is used [21], and is compared to Monte-Carlo simulated production 
made with GEANT4. 
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6.6 Cerenkov Radiation 


All the way through this chapter, we have made the statement that an acceleration is 
required to give rise to a radiative component to the electromagnetic field; this is cer- 
tainly true in most situations. However, a uwniformly-moving charge may radiate in certain 
circumstances; one of those is Cerenkov radiation. Cerenkov radiation is the process in 
which radiation is emitted when a uniformly-moving charge is moving in some medium 
with some velocity ve that is greater than the speed of light vp in that medium. We can ex- 
plain Cerenkov radiation* pictorially by considering the electric field exerted by the moving 
charge [27]. 

We first consider a uniformly-moving charge with ve < vp; even though the charge 
is moving, the field lines from that charge still point in straight lines away from it, and 
are still symmetric in strength ahead of and behind the charge. The converse situation — 
corresponding to Cerenkov radiation — is when ve > Up. Now, there is a contradiction as 
the charge is moving faster than the electric field itself can propagate; the charge outruns 
its field lines. We may think of the charge at each point in time as a different source of the 
electric field, and the retarded time as a wavefront that propagates away from the charge. 
We may then apply Huygens’ principle to deduce the behaviour of these wavefronts. We 
see that, for the case here where uv, > vp, an overall wavefront is formed that propagates 
at an angle to the direction of charge motion (Fig 6.28); the individual point sources (from 
the charges ‘emitting’ field at different times) overlap at an angle 0. We can obtain 6 by 
comparing the distance travelled by the charge in a time At, which is v-At = ScAt, to 
the distance travelled by the individual wavefronts, which is v,At = (c/n)At. The overall 
(linear) wavefront — obtained by overlapping the infinitesimal wavefronts from each emitting 
point — is obviously perpendicular to the direction it’s moving, so we can then obtain 6 as 

£At c 1 


cos 0 i (6.235) 


where £c is the velocity of the charge and n is the refractive index of the medium through 
which the charge is moving (Fig 6.29); notice that n may vary with wavelength, so that 
different wavelengths may be emitted at different angles. 

If we assume that the charge is moving with a large kinetic energy and therefore large 
velocity (and that is very often the case), we may state 8 ~ 1 and our Cerenkov angle takes 
a very simple form: 


1 
cos ~ —. (6.236) 
n 


For example, water has a refractive index of about 1.33 for visible wavelengths. Hence the 
Cerenkov angle is 0 ~ 41°. Notice that Cerenkov radiation is emitted when any charge 
is moving through a material with ve > vp, but we will only see that radiation if the 
material itself is transparent to it. Also, the Cerenkov radiation is emitted at 41° at any 
azimuth around the direction of the charge — the radiation is emitted as a cone. Slower- 
moving particles (3 < 1) give rise to a radiation cone which is narrower, and obviously the 
minimum velocity where Cerenkov radiation is produced is when ve = Up, in other words 
when 3 = 1/n. In water, charged particles have to move faster than 8 = 0.75 to generate 


*Pavel Cerenkov (the surname is pronounced ‘Cherenkov’) was a most interesting Russian scientist, 
and also said to be the inspiration for the Star Trek character Pavel Chekov. As a doctoral student 
Cerenkov studied under Sergey Vavilov, another notable Russian physicist known in radiation physics 
for his description of energy loss of charged particles, and also as the co-discoverer with Cerenkov of 
what is now known as Cerenkov radiation. 
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FIGURE 6.28 An electron moving to the right at a velocity Ue > Up, pictured at successive time 
intervals; the spherical fronts corresponding to different retarded times are shown. The combination of 
wavefronts as the charge moves gives rise to a conical Cerenkov wavefront whose normal is an angle 0 to 


the direction of charge motion. 


vpAt = Ent Emitted wavefront 
n 


vAt = BcAt 


FIGURE 6.29 Geometry of emitted wavefront in Cerenkov radiation. 


Cerenkov radiation (Fig 6.30). Note that we haven’t said what kind of charged particle can 
do this — any charge can generate Cerenkov radiation. However, usually it’s electrons that 
we talk about since they are the most common situation. 

If we can measure the angle 0 of the Cerenkov cone and we know the refractive index 
n, we can measure the velocity ve of the charge. Since a charged particle slows down in a 
transparent material (such as water) by means of ionisation slowing, there is thus a ‘ring’ 
of Cerenkov light emitted for the time the charge’s velocity remains larger than Up. A so- 
called ring-imaging detector (for example the RICH (Ring-Imaging CHerenkov) detector) 
can then determine 0 and combined with a separate measurement of the momentum pe of 
the charge (by measuring the deflection angle caused by a dipole field B), we can determine 
the mass of the charge using pe = Meve — a useful process in particle physics experiments. 

Pavel Cerenkov first observed the radiation named after him when observing the blue 
glow in a bottle of water caused by radioactive particles travelling rapidly through it. The 
interesting part of that statement is the glow was blue, not white. Why is Cerenkov radia- 
tion blue? The reason is that, whilst photons of many different frequencies are generated, 
more blue photons than other visible frequencies are generated. Ilya Frank and Igor Tamm 
obtained a description of this behaviour.* The basic Frank-Tamm formula describing the 


*Pavel Cerenkov, Ilya Frank and Igor Tamm were jointly awarded a Nobel prize for the discovery and 
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FIGURE 6.30 Water has a refractive index of about n = 1.33 for visible light. The minimum velocity 
where Cerenkov radiation can be generated is 8 = 1/ n. Ultra-relativistic particles with 8 ~ 1 have a 


Cerenkov angle of 0 = cos !(1/n) ~ 41°. 


number of photons dN liberated over a given frequency range dw is given by 


= q — sinf 0 (6.237) 


over a distance dz, where 0 is the Cerenkov angle. Converting the number of photons into 
their energies, we may obtain the energy lost at different frequencies as 


dE _ 2 Ho 1 
de 1 (1 z7) dw. (6.238) 


This explains why more intensity is produced at blue wavelengths than red wavelengths — 
Cerenkov light is blue. However, consider the simple case of a constant refractive index n 
(i.e. constant for all frequencies w). The total energy emitted into Cerenkov light becomes 


dE eu 1 a 
— = — |1- dw. 2. 
haa Bs) (6209) 


The integral fa wdw diverges. Hence there must be a maximum frequency of emission when 
n becomes less than 1. High-frequency (short wavelength) radiation above the UV range 
typically has n < 1 (n later rises to around n = 1 for X-rays and gamma rays), so the finite 
amount of energy emitted into Cerenkov radiation is basically limited by the fact that the 
refractive index varies (this is illustrated in Fig 6.31). 


description of Cerenkov radiation. 
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FIGURE 6.31 Variation of permittivity €r = n? with frequency w in a dielectric material such as water. 
If the Cerenkov condition were satisfied at all radiation frequencies then an infinite amount of Cerenkov 
radiation light would be emitted. However, the Cerenkov condition is only satisfied for a narrow range of 
frequencies so a finite amount of radiation is emitted. The condition is typically satisfied in the visible part 
of the spectrum, and the light output is more blue than white because of the frequency dependence of the 
Cerenkov intensity; this is why Cerenkov light in a nuclear reactor appears to be blue. 


Exercises 


1. The United Kingdom JET tokamak utilises a toroidal field system in which the toroidal 
coils have an aperture around 5.5 m in height and 4 m in width; the outer diameter of 
the toroid is 10 m. Estimate the stored energy in the toroid if it generates a magnetic 
field of 3.45 T. 


2. For a plane electromagnetic wave, show that the real part of the time-averaged Poynting 
vector is 5 
E, 
S = rms . 
(8) = e 
3. A focused laser pulse generates a power of 27 TW over a circular area of radius 0.1 mm. 
Calculate the RMS electric and magnetic fields at the focus. 


4. A Hertzian dipole antenna with length 1 cm radiates with a power of 100 W at 100 MHz. 
Find the amplitude of the alternating current fed to the antenna. Determine the mag- 
nitudes of the electric and magnetic fields at points a distance of 100 M away, (i) along 
that antenna axis and (ii) perpendicular to the antenna axis. 


5. A proton is accelerated by a potential difference of 700 kV in a static electric field, over a 
distance of 3 m. Obtain an expression for the ratio of radiated energy to the final kinetic 
energy, and hence show that radiation losses are negligible. 


6. Consider an isochronous cyclotron that produces protons at its extraction point with an 
energy of 20 MeV and an average current of 1 mA; the average field at the extraction 
radius is 1.8 T. What is the emitted cyclotron radiation power for the outermost turn, 
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10. 


11. 


12. 


13. 


and what frequency does it have? Describe qualitatively the pattern of emitted radiation 
and its polarisation. 


For the same cyclotron as the previous problem, now calculate the total cyclotron power, 
assuming the voltage gain per turn is 20 kV. 


A non-relativistic charged particle orbits in a uniform magnetic field. Defining the energy 


decay time as 
du\~* 
— U — 
n 


(where U is the charge’s energy), show that r is given by 


3nreo m? 


e4 B? 
If the magnetic field strength is B = 2 T, calculate the energy decay time due to cyclotron 
radiation both for a proton and for an electron. 


Consider synchrotron radiation from a highly-relativistic electron gyrating with radius p 
in a magnetic field B. Let AO be the angular width of the emitted radiation as seen by 
a (stationary) observer and 7 its duration. Obtain an expression for T’, the time interval 
of the radiation in the reference frame of the electron, and hence show that 


RAO 1 
cy 


2\ 2 
Me {Mel 
T= = i 
eB ( E ) 
where E is the electron energy. This pulse length determines the maximum frequency of 


emitted synchrotron radiation. For electrons of 3 GeV energy moving perpendicular to 
the magnetic field of 0.8 T, estimate the associated maximum emitted photon energy. 


‘cy 


Writing A90 = 1/7, show that 


Consider an electron-positron collider with an energy in each beam of 2000 GeV per 
particle. For a 100 km tunnel length and a dipole field of 0.1 T, estimate the fractional 
energy loss per turn an electron undergoes. Relate this to the likely momentum acceptance 
in such a ring and thereby estimate the minimum number of cavities needed; assume that 
the ring lattice is tuned for 2000 Gev at all positions in the ring. 


The Daresbury Synchrotron Radiation Source generated photons from a circulating elec- 
tron current of 200 mA and an average electron energy of 2 GeV. Given that the ring dipole 
bending radius was 5.56 m, show that the on-axis emitted power was 20.8 W/mrad?. A 
6 T wavelength shifter (WS) was used to increase the short-wavelength flux of photons. 
Calculate the ordinary dipole and WS critical energies, and show that the spectral photon 
flux at 30 keV is increased in the WS by about a factor of 100. 


Consider a 5 GeV storage ring with a circulating electron current of 100 mA. A 4 metre- 
long undulator with K = 1 is installed in one of the ring straight sections; calculate its 
bandwidth and the number of photons emitted in the first harmonic. If six such undulators 
are installed, estimate the fractional increase in energy loss per turn Up. 


A 1 kW beam of monochromatic, 10 keV X-rays is incident upon the end of a column 
of fully-ionised hydrogen plasma in which the number density is 1016 cm~%; the plasma 
column has a cross-sectional area of 1 cm? and is 10 cm long. Estimate the X-ray power 
scattered by the plasma column, and the wavelength of the scattered radiation. 
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14. 


15. 


A laser of wavelength 1 um is scattered from an electron beam to generate 50 keV photons. 
Assuming the ‘head-on’ geometry in which the scattered photon direction is at 180 degrees 
to the incident photon direction, what is the required mean electron energy? What is the 
bandwidth of the 180-degree scattered photons if the electron energy spread is 0.1%? 
For a laser pulse energy of 1 uJ, an electron bunch charge of 100 pC, and an interaction 
repetition rate of 10 Hz, estimate the rate of 50 keV photon production if both the laser 
and electron beams are focused to a size of 10 um at their interaction point. 


A 4 GeV storage ring is designed to have a so-called ‘theoretical minimum emittance’ 
(TME) lattice such that f dipole Ma (s)ds has its minimum possible value in each dipole; 
there are 40 dipoles in the ring. By minimising the value of Tsx in each dipole (by varying 
the values of nz, n4, Bx and a at the dipole entrance, and assuming that the dipoles are 
all the same), show that the natural emittance may be given by 


1 
ezo = = 0.476". 
o= vi 


where @ is the bend angle in the electrons created by each dipole magnet. From this, 
estimate the natural emittance and (assuming that Z4, ~ 0) the equilibrium energy 
spread. 
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In the preceding chapters we described the three most important aspects of the behaviour 
of charged particles in accelerators and the devices associated with them: the effects of 
electric fields upon charges; the effects of magnetic fields upon charges; how the charges 
produce electromagnetic radiation. In those chapters each charge was considered to be 
moving independently of the other charges that may accompany it; the effect of having 
many charges moving together in a bunch being merely additive. For example, the beam 
loading in a cavity is proportional to the amount of passing charge, as is the intensity of 
synchrotron radiation. Moreover, we ignored that the moving charges might influence each 
other. This is true in situations where the bunch charge is sufficiently low. However, there are 
many circumstances where we must take account of the beam intensity — expressed either 
in terms of the bunch charge (or current) or in terms of the bunch density (which depends 
upon the bunch volume). There are a variety of phenomena that can manifest themselves — 
too many for the scope of this book — so in this chapter we describe the principles underlying 
them, and give a few of the most important examples that the reader may encounter. We 
divide our discussion of these self-fields in terms of i) the effect of moving charges upon each 
other (intra-beam forces, space charge and scattering), ii) The effect of bunches upon the 
vacuum system and the consequent effect onto the bunches (wakefields and instabilities) 
and iii) the enhancement of radiation by coherent effects. 
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7.1 Intra-Beam Forces and Space Charge 


We begin by recalling that a slowly-moving (i.e. non-relativistic) charge exerts both an 
electric and magnetic field; we consider now two charges moving together at the same 
velocity v and separated by a distance r (where v L r). The electric force seen by each 
charge (due to the other) is Fg = —e?/4reor? and does not depend on v. The magnetic 
force varies with velocity; the magnetic field experienced by one charge due to the motion 
of the other is B = poqu/4rr?, and the force Fg = quB = v? uoB/4rr?. When v > c 
we have Fg + Fg = 0; the forces cancel*. We may state this equivalently in terms of time 
dilation: in the rest frame of the charges there is only an electrostatic force, but when moving 
at v with respect to a (stationary) observer the overall Lorentz force F = qE../y where Ee 
is the electric field experienced by one charge due to the other in their mutual rest frame. 
We see therefore that slowly-moving bunches will experience a mutual repulsion, known as 
space charge that becomes much less strong as the average bunch velocity approaches c. We 
expect also that the strength of the space charge force is greater if we have more charges 
(i.e. more particles) in the bunch. 

The self-forces within a bunch give rise to a number of phenomena, many of them 
unwanted. These include defocusing (leading to a change in the betatron tunes), energy loss, 
and so on. It is conventional to divide the space-charge forces into collisional interactions 
— those in which particles collide individually — and the overall smooth space-charge force. 
The boundary between these regimes is given by the Debye length, which describes the 
distance over the field of a single particle is screened. The Debye length is 


2 
a (7.1) 

Wp 
where (v?) is the average (thermal) RMS velocity of each charge and wp = \/e?n/meo is the 
plasma frequency for unit-charge particles with a number density n. If the whole bunch is 
moving with some energy and relativistic factor y and we have an (equilibrium) distribution 
of velocities (i.e. a Maxwell-Boltzmann distribution), then the Debye length can be given 


as 
eg ykply 
= ,/ 2B. 2 
AD en ’ (7 ) 


Ty is the RMS thermal temperature of the bunch charges with respect to their average 
velocity such that ymõ? = kgT,/y. If the Debye length is large compared to the bunch 
size (radius) then individual collisions will dominate; if the Debye length is small then 
collective, smooth forces will dominate. For example, a typical relativistic electron bunch 
with radius 100 um and length 6 ps may have an effective temperature kyT, = 0.2 eV, for 
which Ap œ 5 um but an inter-particle distance of, say, less than 1 um. In most situations 
the collisional forces are therefore small compared to the smoothed forces, except that it is 
the collisions that lead to there being an equilibrium ‘thermal’ beam distribution and that 
also contribute to there being an equilibrium beam size. We now consider some practical 
examples. 


7.1.1 Space-Charge Forces 


In a rapidly-moving bunch (y > 1) the electric field from any given charge is only felt in 
a plane that is co-moving with that charge. Hence we may calculate the space-charge force 


*This is still true when we include the effects of length contraction on the electric and magnetic fields. 
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on a particle in a bunch by considering only the two-dimensional charge distribution at 
the same z location as that particle; often the distribution will be Gaussian in all three 
dimensions x, y, and z and we can approximate the electric field seen by a given (‘test’) 
charge as 
er x er y 
Ez ~ , Lym 7.3 
T Ineo Oz(Ox + Fy) ”— Ineo Cy(Ox + Fy) n 


here, eA is the longitudinal charge density and oz, oy, are the 1-ø sizes of the bunch in the x 
and y dimensions. The force on each charge in the bunch appears as an effective defocusing 
effect in both planes; one consequence is that the overall betatron tune is reduced (really, a 
tune spread is induced and hence this effect is called an incoherent tune shift). In a circular 
accelerator the tune shift (for example in the vertical plane) is 


2rer 
B?Boy(on + dy)’ 


1 
Avy = ae f Bukyas. ky = (7.4) 
where here 8, is the vertical 6-function (integrated around the ring) and 6 = v/c. Hence 
for N particles in a Gaussian bunch of length o, we may re-cast this tune shift as 


O 2reN By 
Aw = A f eer a 


As one would expect, the tune shift is proportional to the bunch charge (eN), and is larger 
for smaller-sized bunches. Also, since oy is often much smaller than o, the vertical tune 
shift is generally more important (hence why the formulae were given for Av,). Since the 
effect is to give a tune spread rather than a single tune change for all particles, it cannot be 
compensated by changing the strengths of the magnetic lattice quadrupoles; at a sufficiently- 
large value of Av particles will be driven onto resonances and be lost, causing a beam lifetime 
reduction. As an example, we consider one design of the TESLA 5 GeV, 17 km long, damping 
ring [1]; here the final emittance after damping is ye, = 9 um, yey = 2 wm with N = 2x 10!° 
a bunch length of 6 mm. The tune shifts are Av, ~ —0.02, Avy ~ —0.3. Avy is very large; 
to reduce the tune shift one must do one or more of: decrease the circumference; increase 
the electron energy; increase the (specified) emittance; or decrease the circulating bunch 
charge. 


7.1.2 Space-Charge Dominated Beams 


An intense beam generates mutual repulsion in a moving bunch that gives rise to an effective 
defocusing force between the charges; in general this force is nonlinear. However, there is a 
special (transverse) distribution, known as the Kapchinsky-Vladimirksy (KV) distribution, 
for which this space-charge force is linear [2]. Particles with a KV distribution are uniformly- 
distributed in any two phase-space coordinates (x, 2’, y, y’) such that the RMS size of the 
beam is exactly half the actual beam radius. Since the space-charge force is linear, its effect 
on any particle may be determined by modifying the ordinary Hill’s equations to give 


iE 21 £ 
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where 6,, Õy are the beam extents in each plane (6, # 6, means of course that the beam 
is tranversely elliptical). J = NG is the beam current at any point along the bunch for a 
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(linear) bunch density N, and I, is the Alfvén current, which for electrons is 


Mec? 


Ia = 4reo ~ 17.045 kA. (7.8) 
Bunches with a KV distribution are said to be stably transported [3] while other distri- 
butions can tend to a KV distribution; however, this is a complex topic and the reader is 
referred in particular to Lund’s review [4], earlier work by Hoffman et al. [5] and Reiser’s 
more specialist text [6]. 

The envelope equations (known as the KV equations) for the transport of a KV distri- 
bution can be found as 


2K sc a 

öll + keða t E M, (7.9) 
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where the normalised space-charge perveance Kse = 2Nr./6?y? is equivalent to the ratio 
I/B%y°1a4 (we have here defined esy = 57,,/82,y, SO that €2,y = 4€erms,yRMs)- The three 
focusing terms are from the lattice (terms in k, and ky), from space-charge defocusing 
(terms in Ksc, a perveance effect), and from so-called thermal defocusing (terms in €z,,,). A 
beam is said to be laminar if 


Ex 21 
B, < EEIN (7.11) 
Defining a laminarity parameter 
1 I o? 
P= Tare, ai 


for a normalised emittance €y, = Y€x and (RMS) beam size og, the condition p >> 1 means 
that a beam is space-charge dominated; particles move on trajectories that do not cross and 
the emittance will grow. p < 1 is the condition for a so-called thermal beam where space- 
charge forces can be neglected, and in which individual particle trajectories do cross. As a 
bunch is accelerated, for example in a linac, p gradually reduces and there is a transition 
energy with (relativistic) y 


~=5 (7.13) 


above which the beam changes from being laminar to being thermal. A typical example 
might be an electron bunch driving an X-ray free-electron laser, for which ~kA peak currents 
are required to obtain laser gain. Taking J = 1 kA, €2, = 1 mm-mrad and a beam size 
of oz = 300 um, the transition energy is 1350 MeV. Hence, space-charge forces can be 
significant in electron linacs driving free-electron lasers since they utilise electron bunches 
with these sorts of parameters. 


7.1.3 Emittance Growth and Compensation 


Particle bunches typically do not have ideal (KV) transverse distributions, but are often 
Gaussian both transversely and longitudinally; having a Gaussian longitudinal distribution 
(i.e. in the s direction) means that the current J varies from one end of the bunch to 
the other. The consequence of this is that the space-charge force also varies, for different 
longitudinal slices through the bunch. We saw above that for certain distributions the 
space-charge force is linear — it looks like a focusing term — and hence can be reversed by 
a suitable (external) focusing force; in other words, the correlated emittance effect can be 
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compensated for using quadrupoles. However, this is only true within a single slice of the 
bunch; the centre of a bunch will have a larger defocusing than the bunch ends and there 
will be an uncorrelated phase space dilution for the bunch as a whole. In some circumstances 
this can be alleviated by instead using solenoidal fields in a technique known as emittance 
compensation, first described in 1989 by Carlsten [7] and later developed by others [8, 9]. 

Practically, the estimation of space-charge effects is carried out using one of many avail- 
able codes that take inter-particle forces into account. These codes loosely fall into one 
of three categories, depending on the number of dimensions that are used to describe the 
particle density within the bunch. One-dimensional codes such as HOMDYN [10] treat a bunch 
as independent longitudinal slices, and within each slice effectively use a single (circular) 
size and density which is used to evolve the particle distribution using an approximation to 
the envelope equations; a bunch is followed (‘tracked’) incrementally in small steps along 
s (or in time t) through an accelerator lattice to obtain an estimate of the resulting parti- 
cle distribution. Two-dimensional algorithms such as in ASTRA [11] divide each longitudinal 
slice further into concentric, cylindrically-symmetric rings with varying charge density. Fully 
three-dimensional algorithms follow a reduced but representative subset of particles — known 
as macroparticles — and calculate their individual motion due to the integrated effect of all 
the others; GPT [12, 13] is a widely-used example of such a code. In these codes, the electric 
field is generally calculated by solving Poisson’s equation to obtain the fields on a finite 
mesh of points (a particle-in-cell method) [14]. As the number of dimensions used increases 
there are more inter-particle calculations to be performed each step; this was the origi- 
nal motivation for using lower-dimension approximations and for using macroparticles for 
three-dimensional simulations. Limborg et al. have studied the relative accuracy and speeds 
of a number of codes [15], and Neveu at al. have performed a more recent analysis [16]. 
Today, fast space-charge solvers and parallelisations of several codes exist (for example 
OPAL [17] and IMPACT-T [18, 19]) that allow more accurate calculations to be performed in 
a reasonable time. 


7.2 Scattering Processes 


7.2.1 Intrabeam Scattering 


Intrabeam scattering (IBS) is a collective effect that occurs within particle bunches; multiple 
Coulomb (elastic) scattering events occur between pairs of electrons, giving rise to a transfer 
of energy between them; this process was originally called multiple Touschek scattering (see 
below). The process can be thought of in terms of there being an effective temperature in 
each of the x, y, s bunch directions due to the different particle momenta, and IBS gives rise 
to an equilibrium being formed between those directions; in addition, there is net energy 
being given to the particles from the RF acceleration and in the case of electrons there 
is also radiation damping. Thus, for hadron (e.g. proton) beams we expect a steady and 
unbounded growth of the beam emittance — so that we must keep the IBS growth rate 
small — and for electrons we expect an equilibrium emittance to be obtained which is larger 
than the natural (i.e. ‘zero-current’) emittance. Originally formulated by Piwinski [20] and 
developed by Bjorken and Mtingwa [21], there are today also more convenient approximate 
formulae from Kubo and Wolski [22], or from Bane [23]. In an electron storage ring, we 
start by writing an evolution of the emittance with time as 


dex 1 2 
dt 7, T a0) + Fe (7.14) 


where Ty is the growth time due to IBS and 7, is the synchrotron radiation damping time 
giving the natural emittance €,9 (see Chapter 6); similar expressions may be written for the 
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y and z directions. The equilibrium emittance including IBS, e% is given by 


T; 
—— 7 aö: .1 
€ T7, (7.15) 


The general form of the growth rate (1/Tu in each plane u = x,y, s) is 
1 
~ 4T A(log) (J fule D) ; (7.16) 


where () denotes an average of fu around the ring lattice (fu being a function of the lattice 
and bunch parameters), (log) is the co-called Coulomb logarithm and 
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here, 8 = u/c, y is the relativistic factor, os the energy spread, and gs is the bunch length. 
Immediately we see that the scaling œ 1/7* means that IBS is only relevant at lower 
electron energies, which for practical purposes in electron storage rings is around 3 GeV 
(or perhaps somewhat higher for some damping ring designs). The Coulomb logarithm 
describes the ratio between the maximum and minimum impact parameters relevant for an 
electron-electron collision, and it is conventional to use the classical electron radius re as 
the minimum and the vertical beam size as the maximum. For example, taking a modern 
3rd-generation ring we might have ey ~ 10 pm and (6y) ~ 5 m, so that 


V5 x =) a 


Te 


(log) = In | (7.18) 
This is the original value assumed by Bjorken and Mtingwa in their analysis. However, there 
is no clear consensus as to the correct value of (log) and values between 10 and 20 have been 
proposed; similarly, modifications to account for the non-Gaussian nature of some beams 
have also been proposed. Hence, calculated IBS growth rates should be checked against 


measured values as has been done in the Japanese ATF ring. A useful approximation to 
IBS is given by the CIMP (Completely Integrated Modified Piwinski) formulae [22], which 
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and the scaled bunch dimensions 


zae s=% |b (7.23) 
o Ex 


These are fairly involved expressions, and there are established (and validated) codes such 
as elegant [24] that will determine IBS growth rates. To give an idea of the effect of IBS, 
consider again the 3 GeV, 561.6 m Diamond storage ring with a coupling (i.e. emittance 
ratio) x = 0.01. With czo = 2746 nm-rad, and 936 bunches with zero-current bunch length 
050 = 3.83 mm (13 ps), with 300 mA circulating current there is only a very small increase 
in emittance from IBS (2754 nm-rad). Conversely, one multi-bend achromat design gives 
€x0 = 44 pm-rad; even with a larger k = 0.1 to reduce the growth rate the equilibrium 
emittance grows to 79 pm-rad — an increase of 80%, with similar increases in the vertical 
emittance and bunch length. Whilst IBS is predominantly considered for circular machines, 
it may also occur in sufficiently-long linacs when the electron emittance is small [25]. 


7.2.2  Touschek Scattering 


Touschek scattering (first explained in 1963 by Bruno Touschek and collaborators [26]) is 
related to intrabeam scattering, but here we are concerned with those scattering events in 
which an appreciable transfer of momentum occurs between two particles — perhaps 1% 
of the momentum or more. After such a scattering event, one particle has an energy +Ap 
above the mean bunch energy and the other has an energy —Ap below (both Ap values are 
the same magnitude). Again we consider here an electron ring, and in this case the initial 
momentum change from the scattering will persist for a time ~ 7, before the electrons 
damp back into the ‘core’ of the bunch; there is therefore a diffuse ‘halo’ of Touschek- 
scattered particles, with density above that from the synchrotron radiation, continuously 
being excited and damped. However, the RF accelerating voltage gives a bounded energy 
acceptance err that is typically between 1% and 3% (see Chapter 6); electrons that are 
Touschek-scattered outside this limit are lost, and this leads to a finite beam lifetime given 
by 
1 1 dN r2cNp 1 
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where Gg y,s are the bunch dimensions (in metres), € is a scaled parameter defined as 
2 
€ 
ae ( ante ) . (7.25) 
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and the function D(e) is defined as 
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This function is shown in Fig 7.1, and arises from consideration of the electron-electron 
(Möller) scattering rate into a given energy deviation (see for example the detailed derivation 
by Le Duff [27]). Often we have « < 1, and hence the lifetime 7 œ «ĝp. In modern-day 
electron storage rings and damping rings the Touschek lifetime is often measured in terms 
of hours and is the shortest beam lifetime encountered; Touschek scattering determines 
how long the beam can survive for. One may increase the RF voltage to increase erp, 
and generally despite the bunch shortening caused by the higher voltage the Touschek 
lifetime is increased. For a large enough voltage the RF acceptance becomes larger than the 
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FIGURE 7.1 Behaviour of the function D(e). 


energy acceptance given by other limitations; these are the physical momentum acceptance 
— since the Touschek-scattered particles move laterally because of dispersion such that Ax = 
na(Ap/p) — and the dynamic momentum acceptance. The dynamic momentum acceptance 
is here defined as the largest momentum deviation that remains stable (commonly found to 
be no more than 3 — 4%), and the limitation arises because of non-linear effects upon the 
electron motion (see Chapter 5 for a fuller discussion of dynamic aperture and momentum 
acceptance). 

A number of approximations have been used to determine the Touschek lifetime, starting 
from Briick’s [28] and including the method of Völkel [29]; some older codes such as ZAP use 
these approximations [30]. These may be used in certain circumstances depending mainly 
upon the relative horizontal and vertical beam velocities with respect to the particle energy, 
and the reader is advised to check the limits of validity for those approximations before using 
such calculations. More modern codes such as elegant exist that are generally reliable across 
different electron parameter regimes. 


7.3 Wakefields 


As we have just seen, a charged particle moving at v = c through a vacuum — or equivalently 
moving through a smooth beam pipe whose walls have zero resistance — does not see the 
fields generated by other particles unless they have the same longitudinal coordinate s; this 
is because the electromagnetic field takes the form of a thin disk travelling with the particle. 
However, a finite-conductivity vacuum boundary near the bunch contains charges that will 
be attracted under the effect of the passing charge’s field, creating an image current. Since 
the image current will dissipate energy due to the finite conductivity, there is therefore 
energy loss from the charge and the beampipe acts effectively as an impedance to slow the 
charge. Due to the retarded time T = t— r/c between when the charge passes and when the 
charge’s field is seen at the beam pipe (for a distance r between charge and pipe boundary) 
there will remain an electromagnetic field in the wake of the passing charge. This wakefield 
may then act back upon charges that follow the first; high-velocity charges cannot exert a 
wakefield in front of them — a form of the principle of causality. We introduced and briefly 
discussed wakefields in Chapter 3 from an RF cavity perspective and now we discuss it in 
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more detail. 

When a particle bunch reaches a beampipe discontinuity — such as an insulating gap 
or a cross-section change such as a cavity — the (free) particles in the bunch continue to 
travel unimpeded, but the image current must go around the discontinuity through the 
conduction path. There will be a decelerating force on the bunch as it moves away from 
the image charge/current, and the beam will lose energy in the form of radiation to the 
electromagnetic fields driven by these surface currents and charges. This radiation will 
remain and can interact with later bunches, or indeed later charges within the same bunch 
and can in some cases cause significant beam disruption. Energy in unwanted modes can 
adversely affect the shape and trajectory of later bunches in a bunch train through beam- 
beam instabilities. If the wakefield from the head (front) of a bunch interacts with the 
particles in the tail of the same bunch, we call it an intra-bunch, or short-range, wakefield; 
if the wakefield from one bunch interacts with a later bunch, it is referred to as an inter- 
bunch, or long-range, wakefield. 

If we have a beam propagating in a perfectly-conducting beampipe of constant cross 
section, the phase velocity of all the waveguide modes of the beampipe are greater than the 
speed of light at all frequencies and there is no power loss from the beam’s space-charge 
fields to the perfectly conducing walls due to the image charge, so there is no net interaction 
of these modes with the beam. Hence an ultra-relativistic bunch (y >> 1) in a perfectly- 
conducting smooth-walled beampipe generates no wakefield; to induce a wakefield, one of 
two things must occur: either there is an obstacle to reflect the fields to slow down the phase 
velocity /localise the fields (known as a geometric wake), or there is a finite conductivity 
causing the image current to lose energy (known as resistive-wall wake). 

In geometric wakes, a discontinuity scatters the field. However the discontinuity needn’t 
only be a cavity: it may be a coupler, a corrugation (such as a vacuum bellows), a surface 
imperfection (small features such as flanges, diagnostic instrumentation or pumping ports), 
surface roughness, or any similar change in the beampipe’s otherwise constant cross section. 
Where the beampipe or cavity cross section is reduced or increased, the wakefield is strongly 
dependent on the smallest aperture size. Typically the aperture size is proportional to the 
wavelength of an RF cavity, hence higher-frequency cavities tend to have higher wakefields. 
In resistive walls the longitudinal wavenumber, k,, becomes complex due to the finite con- 
ductivity of the walls, causing a transfer of energy between the particle and the EM wave. 


A driving charge q’ traversing a discontinuity induces a wakefield that will persist for 
some time. At a later time, a test charge q, a distance s behind the driving charge, will 
experience that wakefield. The wake function w,(s) describes the effect of the driving charge 
on the test charge, where w, is the voltage per unit drive charge as a function of the 
distance between the two particles, given as a superposition of the voltage from all modes 
in the system. If we consider a particle trailing the driving charge by a distance, s, and we 
integrate the electric field, E,, seen by that particle along the beam path in z — divided by 
the driving charge — we obtain the wakefield 


1 L 
w(i) =- f EGéaeea\ as (7.27) 


where q is the charge of the driving bunch, and the field extends for a distance, L. The 
wakefield is normally stated in V/nC or V/pC. It is often more useful to represent the 
wakefields in the frequency domain. This is known as the coupling impedance and is the re- 
lationship between the decelerating voltage and the beam current. The coupling impedance, 
Z| is calculated by performing a Fourier transform on the wake potential, and is measured 
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in Ohms [31]; we have 
Z| (w) = | Wz exp (—iwt)dt. (7.28) 


—6o 
For an RF cavity or other discontinuity with an electromagnetic resonance at some frequency 
(where there are two discontinuities between which radiation can reflect), the impedance 
plot has 3 distinct regimes, namely i) cut-off, ii) narrowband impedance, and iii) broadband 
impedance. This is illustrated in Fig 7.2. 

Below the beampipe cut-off frequency the impedance is close to zero as there are no 
propagating modes with a real k,, although there is a small impedance due to the band- 
width of modes above cut-off extending to lower frequencies and the evanescent modes. 
The resonant discontinuity is connected to the rest of the accelerator by a beampipe, which 
is normally a circular waveguide. Below the cut-off frequency of the beampipe any modes 
which are resonant are trapped and only exist as certain frequencies. Each of these modes 
is a narrowband impedance as it has a small bandwidth and the modes do not overlap 
greatly. In between the resonances we see regions that have very low impedance over a very 
narrow band. These are known as Fano resonances, where the reactance of a mode above 
its resonant frequency cancels out the reactance of a mode below its resonant frequency. 
Above the cut-off of the beam-pipe the modes are travelling waves and can propagate out 
of the system through the beampipes. If the bandwidth of each mode is very large, at any 
given frequency the fields will be a superposition of several modes. This causes a continuous 
broad impedance spectrum. The beam may interact with the beam at the discontinuity or 
elsewhere in the machine if the radiation has propagated away. 

The narrowband regime is normally treated in the frequency domain. The impedance 
can also be given from the equivalent circuit impedance as seen in Chapter 3, 


Rs circut 
Z= ee . (7.29) 
tty (2 = s) 


where Rs is the shunt impedance of the cavity, and Qz is the loaded Q factor. If we transform 
the impedance back into the time domain, we get a wave that is described by 


woR aj 1 1 1 
w(t) = —s COS Wo, / 1 t sinwo,/l1——,t], (7.30) 
2Q V Qt rua Qi 
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where 7 is the decay time given as 2Qz/wo, and Qz is the loaded Q factor of the cavity. 
The damping causes a phase shift between the beam and the excited RF wave, as well as a 
small shift in the wake’s frequency from the resonant frequency of the mode, wo. Normally 
the Q factor is sufficiently large that this reduces to 


w(t) = —s~*/7 (cos wot) . (7.31) 


It is commont to define the loss parameter, k, for each mode to quantify the strength of 
interaction where, 


wR 
k= —. 7.32 
e (7.32) 
The effect of the beam’s longitudinal profile on the wakefield is found by multiplying 
the frequency spectrum of the beam by the impedance, which can then be converted back 
to a wake potential by using an inverse Fourier transform. The beam spectrum, (w), for a 
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FIGURE 7.2 Illustration of the behaviour of coupling impedance as a function of frequency, showing 
typical impedance spectrum in the cut-off, narrowband and broadband impedance regimes. 


bunch of charge q with a Gaussian charge density distribution in space, is also a Gaussian 
in the frequency domain and is given by 


wo? 


w Oz 
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I(w) = qexp — (7.33) 


where o, is the (Gaussian) standard deviation of the bunch in the z direction. 


7.3.1 Short-Range Wakefields 


In order to obtain the wakefield of an entire bunch, W,, it is necessary to convolute the 
wakefield of a single point charge, wz, with the charge density of the bunch. 


W.(s) = a wz(z)àg(s — z)dz (7.34) 


where Aq is the longitudinal charge density profile of the bunch. 

The total energy lost by a bunch due to its own wakefield is given by the loss factor 
(AK), in electron-volts per unit drive charge, which integrates the wakefield multiplied by 
the charge density along the length of the interaction 


1 co 
AK = -- | W.(2)Aq(z)dz. (7.35) 
q J-co 
Most bunches do not have Gaussian distributions, instead having ripple in the charge spec- 


trum, but it is taken as the standard distribution for wakefield calculations due to its smooth 
roll-off with frequency. If the beam has a Gaussian spectrum then the standard deviation 
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FIGURE 7.3 The short-range wakefield for a Gaussian bunch with 0, = 0.3 mm from a 9-cell RF cavity 
with a 30 mm aperture along with the line charge density. The head of the bunch is to the left of the plot. 


will be inversely proportional to the beam’s time duration. Long-range wakefields are dom- 
inated by the narrow-band impedance region as the broadband impedance quickly decays 
as the energy propagates down the beampipe. For short bunches, which hence have a large 
spectrum the short-range wakefield is dominated by the broadband region, as while the 
impedance is lower than the narrowband region it extends over a wide frequency spectrum 
for short bunches. The short-range wakefield for a Gaussian bunch with o, = 0.3 mm from 
a 9-cell RF cavity with a 30 mm aperture is shown in Fig 7.3. The electrons at the head 
of the bunch do not see any deceleration due to the wakefield, while electrons later see the 
full wakefield. For very long bunches, the higher-frequency components in the wake may 
oscillate causing the electrons in the tail to be accelerated. 


7.3.2 Long-Range Wakefields 


If we integrate the real part of the impedance in frequency around a mode and divide by 
the resonant frequency of the mode, we obtain the geometric shunt impedance (R/Q) of 
that mode. The longitudinal wakefield induced in a single mode in the narrowband region 
by a single bunch is given by 


W, = 2k cos(wt)e™“*/ 22r, (7.36) 
Due to superposition we can simply sum the wake from each mode (with index m) together 
to obtain the total wakefield 
W, = Ñ 2km cos(wmt)e “ere, (7.37) 
m=1 


It can be seen that the longitudinal wake is given as a sum of damped cosine waves, each 
with different amplitude and frequency, which are all initially in phase at t = 0. 
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7.3.3 Transverse Wakes 


As well as accelerating beams, wakefields can also deflect bunches. Dipole modes have 
transverse electric and magnetic fields which can kick the beam if excited. In order to excite 
a dipole mode there must be an energy exchange between the bunch and the dipole mode, 
which requires electrons to be decelerated by a longitudinal electric field. TE dipole modes 
do not have longitudinal electric fields hence they cannot be excited by the beam. The 
longitudinal electric field of a TM dipole mode is given by 


E, = By (=) cos(¢) cos (pr) eit (7.38) 


where Cmn is the n-th root of the m-th Bessel function (in this case the 1st Bessel function 
Jı), Lis the cavity length, and a is the cavity radius. A beam travelling along the central axis 
(r = 0) will not excite a dipole mode, but a beam that is offset will excite a dipole wakefield, 
referred to as the transverse wake. Once excited by an offset bunch, future bunches will be 
deflected even if travelling along the central axis. It is also possible to excite a transverse 
wake if the cavity is asymmetric due to couplers, or asymmetric cavity geometries which 
cause the dipole mode to gain a longitudinal electric field at r = 0. The transverse Lorentz 
force in the direction of the x-axis, Fy, is given by 


F; = e(Ez +vBy), (7.39) 


where v is the beam velocity. If the beam is highly relativistic then the transverse movement 
is small over the cavity length, L, and hence the transverse momentum change, Ap,, is given 
by 


L 
Apr = ef (Ez + cBy)dz. (7.40) 
0 


It is useful to define a transverse voltage, Vz; it is not strictly a voltage as part of the force 
comes from the magnetic field. However the electric and magnetic forces are very similar if 
the beam is very relativistic due to the longitudinal momentum being orders of magnitude 
greater than the transverse momentum. 


L 
V, = f (Ey + cBy)dz (7.41) 
0 


and A 
Vy = | (E; + cB,)dz. (7.42) 
0 


Applying Maxwell’s equations to these equations and integrating along the cavity with 
the limits in the zero field region, we can derive a relation between the transverse and 
longitudinal voltage of a dipole mode 


Vi= s ViN (7.43) 


where V, is a vector sum of Vz and Vy. By applying this to the equation for the longitudinal 
wakefield, we can derive the transverse wakefield, W1, as 


= So sin(wt)e7™ “t/r, (7.44) 
The effect of the dipole wake is to deflect the beam and the beam offset downstream is 
given by 


WL 


V 
Ag = Maz, (7.45) 
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where Ma2 is the transport matrix element (described in Chapter 5), relating the beam offset 
at a point downstream to the divergence at the cavity and K is the beam’s kinetic energy. 
The transverse wake is more sensitive to aperture size than the longitudinal wakefield. The 
CLIC-G 12 GHz structure has a peak transverse wakefield of —250 V/pC/mm/m which is 
damped to —2 V/pC/mm/m for a particle 0.15 m behind the driving bunch. 

The excitation of a dipole-mode wakefield in a cavity is commonly used for measuring 
beam position in accelerators, as the wakefield is proportional to the beam offset. This can 
be performed either by using wakefield monitors in accelerating RF cavities or by making 
special cavity beam position monitors as bespoke devices specifically for measuring the 
beam position. Dipole modes are also used as deflecting cavities in accelerators as bunch 
separators, or to give the head and tail equal and opposite kicks to rotate a bunch as a 
diagnostic or to align a bunch for collision at a crossing angle (known as a crab cavity). 


7.3.4 Multiple Bunches 


As well as using superposition to sum the wake from each mode, we can also use super- 
position to sum the wakes from multiple bunches. We must include the energy lost by an 
electron bunch due to its own wake. As the wake grows in time as the beam passes the 
finite cavity length, on average the deceleration voltage experienced by a bunch is half of 
the wakefield voltage it induces, known as the fundamental theorem of beam loading. The 
value of half can be derived by considering energy conservation of the deceleration of an 
electron bunch by a cavity voltage, and considering both the energy gained by the cavity 
and the energy lost by the beam [32]. Taking this into account the total wakefield, W,, over 
several modes, m, and N bunches is given by [33] 


foe) N 
W.=>) (in +Y km cninnrjsone 2 (7.46) 


m=1 n=1 


for the longitudinal wake, where 7 is the bunch spacing in time and is equal to the reciprocal 
of the bunch repetition frequency, frep, (T = 1/frep). While for the transverse wake, W1, 
it is 


W, = 5 5 =—— sin(wmnr)e 2mnt/2Qrm (7.47) 


The longitudinal wake is given as a sum of damped cosine waves above, which converges 
to a finite value — known as the sum wake — after the wake from the first bunch decays. If a 
given mode has a resonant frequency at a harmonic of the bunch repetition frequency, the 
wakes have constructive interference driving a larger wake, while if they are a half integer 
the wake cancels every second mode. The harmonics of the bunch repetition frequency are 
known as machine lines, and special care must be taken in designing an accelerator with 
modes close to them. For example, in ESS the specification requires no HOM is within 
3 MHz of a machine line. The higher the Q factor of a HOM, the higher the sum wake as 
the field excited may remain in the cavity for several times the bunch seperation. To reduce 
the sum wake it is usually necessary in multi-bunch machines to damp the HOMs using 
HOM couplers as discussed in Chapter 3 

For the transverse wake, the sum wake is the sum of damped sine waves. In this case 
the maximum wakefield isn’t at a machine line, as while this will drive a large dipole field 
in the cavity, the transverse force is zero as the transverse and longitudinal wakes are 90 
degrees out of phase. Instead, the largest wake occurs at a frequency slightly off the machine 
line, with the exact frequency depending on the Q factor of the dominant dipole mode. If 
we take an example of a bunch separation of 10 ns and a wake dominated by three modes 
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FIGURE 7.4 The total wakefield for each bunch in a train of 183 bunches, for a bunch separation of 
10 ns and a wake dominated by three modes at frequencies of 3.050 GHz, 4.000 GHz and 8.035 GHz, with 
(R/Q) of 120, 20 and 100 Q and loaded Q factors of 50, 1000 and 10,000 respectively. 


at frequencies of 3.050 GHz, 4.000 GHz and 8.035 GHz, with (R/Q) of 60, 10 and 50 Q 
and loaded Q factors of 50, 1000 and 10,000 respectively, we get the total wake shown in 
Fig 7.4. The first mode at 3.050 GHz is strongly damped, so it provides almost the same 
wake to every bunch. The second mode at 4.000 GHz is at a machine line and hence has a 
wake which increases every bunch over the first 50 bunches but then remains constant; this 
mode has the largest effect on the sum wake, even though it has the lowest impedance. The 
third mode is not at a bunch harmonic and hence the wake oscillates every bunch. Hence 
for very short bunch trains we are most concerned with modes with high R/Q, but a mode 
that is dominant over short timescales may not be dominant over longer timescales, hence 
for longer bunch trains we are most concerned with modes with high Q factors that are 
close to machine lines. 


7.3.5 Wakefield-Driven Instabilities 


Wakefields/impedances lead to energy spread and emittance growth in accelerators, but 
also cause instabilities that drive the beam offset and can cause beam loss. As we have 
seen, the transverse wake is zero if the driving charge is on the beam axis, but increases 
linearly with beam offset. The force on a trailing particle, where a wake has previously been 
induced, is independent of offset to first order. This leads to two feedback mechanisms, 
known as beam-breakup instabilities (BBU). The first is cumulative BBU, where an offset 
bunch at the start of a linac will cause subsequent bunches to be deflected, independent 
of their offset. These subsequent bunches will enter the next cavity at an offset, driving a 
larger wake which will in turn deflect the following bunches; this induces still larger wakes 
in the third cavity, and so on. In long linacs, this can lead to significant beam loss at the 
end of the linac. The second feedback mechanism occurs in energy recovery linacs (ERLs), 
and is known as regenerative BBU. In ERLs each bunch will have one or more accelerating 
passes of a linac, before passing the same cavity again for an equal number of decelerating 
passes before being dumped. This means the beam loading will cancel out between bunches 
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being accelerated and decelerated, allowing very high beam currents. The beam current 
in an ERL is instead limited by regenerative BBU. Here an offset bunch again deflects 
subsequent bunches, which then return to the same cavity with an offset. If the offset bunch 
drives a wake such that the next bunch will return to the cavity with a larger offset than 
the first, and hence the wakefield amplitude grows, then the bunch offset will increase with 
each subsequent bunch until beam loss occurs. The BBU start current, I, — where the offset 
starts to grow — for each mode is given by [34] 


2c? 
e(R/Q)Qrw M2 sin (wt,)’ 


where t, is the revolution time, i.e. the time it takes an electron bunch to make a single loop 
of the ERL and return to the same cavity. The regenerative BBU start current is the lowest 
start current for all modes. It can be seen that the start current is inversely proportional to 
sinwt, hence for the highest impedance modes we can design the revolution time to be a 
harmonic of the RF frequency to increase the start current for that mode. The start current 
can also be increased with careful design of Mj2 or by using strong HOM damping. 

Short-range wakefields are also an issue in linacs, with the head of an offset bunch 
driving a wake which deflects the tail creating so-called banana bunches due to their curved 
transverse profile. This effect can be reduced by an approach known as Balakin, Novokhatsky 
and Smirnov (BNS) damping after the originators [35]; they suggested that if the head and 
tail had different kinetic energies then their betatron motion would be different for the head 
and the tail causing them to oscillate in position and transverse momentum at different 
frequencies. As such, as the tail oscillates back and forward, it would be out of phase with 
the wake causing cancellation over the length of the linac. 

In circular machines the transverse wakefield leads to tune shifts, which if sufficiently 
large can lead to transverse instabilities as discussed in Chapter 5. Like with space charge 
these lead to tune spreads which therefore cannot be compensated with quadrupoles. We 
can also use octupoles to stabilise the beam against external excitations, inducing what is 
known as Landau damping; for more details see the discussion in Lee [36]. 


lst = 


(7.48) 


7.4 Coherent Synchrotron Radiation (CSR) 


In chapter 6 we looked at synchrotron radiation emitted from moving charges in accelerators. 
We saw that this radiation depends linearly on the number of charges in a bunch, i.e. the 
emitted power P x Nz. However, looking again at the Larmor formula, 


qea2y4 

P= rga’ (7.49) 
we note the factor q? in the numerator; this would imply that two charges (say, electrons) 
in very close proximity should radiate with a power œ q? rather than œ 2q — i.e. they should 
radiate coherently. This is in fact true, and charges do radiate coherently if they are close 
enough together. In the case of synchrotron radiation, a bunch of electrons emits either 
incoherent synchrotron radiation (ISR) or coherent synchrotron radiation (CSR) depend- 
ing upon the separation of the electrons; we expect coherent radiation for those emitted 
wavelengths A which are comparable to the size of the electron bunch Gg, y, as described by 
Schiff in 1946 [37] and first observed in 1989 [38]. Since the number of electrons in a bunch 
can be quite large — perhaps Mp ~ 101° to 101! — the coherent enhancement of the radiation 
power can be huge. We should distinguish here the (coherent) enhancement of the radiated 
power — due to the proximity of the electrons to each other — from the coherence of the 
photon output. Incoherently-emitted photons at wavelength A may be coherent with each 
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other, if the source size of the electrons is small. The condition for photon coherence is that 
the emittance in each plane 


sy < in (7.50) 


such an (ISR) source is known as diffraction-limited, but even for emittances € ~ /47 or 
larger there will be some partial coherence of the emitted radiation. 

To illustrate the power enhancement due to CSR, we consider an electron bunch with 
total charge Q = 1 nC circulating with kinetic energy 3 GeV in the Diamond electron storage 
ring, choosing a rather short buch duration of 1 ps (in other words, the electrons are grouped 
together in a length l = ct ~ 0.3 mm); these are realistic values for the length and charge’. 
As we saw in Chapter 6, the total incoherent power radiated by each electron from the 
dipoles is 86 nW (over all wavelengths), and therefore the total incoherent power radiated by 
the electron bunch is 537 W - already a large value. However, wavelengths A > 0.3 mm (i.e. 
in the microwave part of the spectrum) radiate coherently, and so there is an enhancement 
of the radiated power by a factor of Q/e ~ 6 x 10° (although emission at wavelengths larger 
than the size of the vacuum vessel are suppressed); the coherent enhancement in the power 
emitted at those wavelength is absolutely enormous. Remembering that relativistic electrons 
already see a power enhancement œ 74, we see that coherent radiation from relativistically- 
moving charges can generate huge numbers of photons at long wavelengths since we can 
produce significant numbers of electrons in a small bunch. This enhancement is shown in 
Fig 7.5. 
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FIGURE 7.5 The solid line shows the incoherent synchrotron radiation power for an electron bunch 
charge of 1 nC, i.e. a number of electrons Ny ~ 6.24 x 10%; the vertical scale is the radiated power 
compared to that radiated power of 1 electron at the critical frequency We = 1.28 x 1019 s7! (8.4 keV 
photons). The dotted line shows the coherent enhancement up to another factor Np that occurs for long- 
wavelength (low-frequency) photons for an electron bunch duration of 1 ps; NV, a is an enormous factor even 


for modest bunch charges of 1 nC. 


* The transverse size of the bunch is much smaller than the length. 
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A complete treatment of CSR is complex and we refer the reader to many excellent works 
in the literature [39, 40, 41]. The analysis begins with the Lienard-Wiechert interaction 
between two electrons; we consider a ‘steady-state’ regime in which all electrons within a 
bunch experience a constant magnetic field, for example as found in the middle region of 
a bending magnet. The complete calculation for two electrons can be found in Saldin [39] 
and gives the power emitted in terms of the charge and the separation. The extension to 
a larger number of particles requires some approximation; a common one is to assume a 
one-dimensional charge distribution (i.e. all electrons follow the same orbit). The validity 
of such 1D CSR models is typically given in terms of the Derbenev criterion [42], stated in 
terms of bending radius p and the bunch size g, as 

Oz 
(po2) 178 <1; (7.51) 


this must be satisfied for the 1D model to be valid in the horizontal plane. A similar condition 
can be made in the vertical plane. For practical calculations of CSR numerical methods are 
generally employed, for example using elegant [40] or GPT [7, 8, 41]. A recent paper by 
Brynes et al. gives a useful review [41]. 

One practical consequence of CSR is that the emitted radiation (in a dipole) gradually 
overtakes the electrons themselves, and produces an effective wakefield; in contrast to the 
conventional wakefields described above, the CSR wake occurs in front of the emitting 
electrons rather than behind. CSR emission causes an average loss in energy for any given 
electron, but electrons towards the front of a bunch are generally accelerated with respect 
to those at the rear, which are relatively decelerated. This in turn can result in a number 
of phenomena that may degrade the quality of the electron bunch [43]. One consequence 
is that the overall emittance may be increased [44], but may also be compensated using a 
suitable beam-optics arrangement [45]. Another is that CSR can give rise to a longitudinal 
modulation of the electron density over distances shorter than the bunch length, a so- 
called microbunching; this is particularly important in free-electron laser (FEL) design, 
where unwanted microbunching may interfere with the similar process for FEL gain. The 
microbunching instability was described by Heifets et al. [46] with useful formulae given by 
Huang and Kim [47]. 

The FEL itself is the premier source of coherent radiation from undulators. They are 
able to generate tuneable laser-like output and can operate up to X-ray wavelengths. An 
excellent review of the physics of FELs and their output properties is given in [48]. 


Exercises 


1. Estimate the Debye length for the following situations. i) a typical fluorescent lamp 
plasma, in which the electron density is around 1016 m~? and the electron temperature 
around 1 eV, ii) a tokamak plasma for which the (fully-ionised) plasma density is around 
107° m~? and the ion/electron temperature ~10 keV, iii) a low-energy electron bunch 
with transverse size 0.1 mm, length 10 ps and a charge of 100 pC. 


2. Estimate the space-charge-induced tune shift in a proton ring in which a smooth 1 A 
current of 10 MeV protons moves in a radius of 1 metre. You may take the emittance in 
each plane to be 10 mm-mrad and the (-function to be 1 metre. 


3. The ALICE energy-recovery linac circulated electron bunches with a charge of 80 pC, 
a normalised emittance (discussed in chapter 5) of around 3 mm-mrad and a typical 
bunch length of 4 ps. Determine for what energy range the bunches will experience signif- 
icant space charge; is space charge important for the ALICE energy range from 350 keV 
(injector) to 35 MeV (FEL operation)? 


Multi-Particle Motion 301 


10. 


The 6 GeV PETRA-III storage ring at DESY has a very small natural emittance of 1.3 nm- 
rad and operates with a small coupling around 0.6%. Making sensible assumptions about 
the 6-functions and bunch length, estimate the Touschek lifetime for a stored current of 
100 mA and a momentum acceptance of 1.6%. Is the Touschek lifetime significant when 
compared to a beam-gas scattering lifetime of 14 hours? 


The dominant higher-order mode in a cavity, the TMo29 mode, has a resonant frequency 
of 2 GHz, a geometric shunt impedance of 100 Q and a loaded Q factor of 50. Calculate 
the frequency shift, and phase shift at t = 0 between the resonant frequency of the cavity 
and the frequency of the wake caused by the low Q factor. 


For the same cavity a continuous train of bunches of 10 nC charge, separated in time by 
by 15 ns, traverses the cavity. Calculate the sum wake for the TMo29 mode, assuming the 
bunches are very short compared to the mode’s wavelength. 


If the same cavity is placed in an ERL, with Mı2 = 1 and a revolution time of 500 ns, at 
what current does regenerative BBU begin? 


What is the diffraction-limited emittance for 8 keV photon emission? 


The 6 GeV European Synchrotron Radiation Facility has recently been upgraded from 
a natural emittance of 4 nm-rad to an emittance of 0.13 nm-rad. Making sensible as- 
sumptions about the insertion device field and coupling, determine whether the ESRF 
output is diffraction-limited either with its old or new design. You may assume a vertical 
emittance of 10 pm-rad in both configurations. 


Using the same ALICE parameters as in the problem given earlier and assuming a dipole 
field of 0.2 T, determine for what wavelengths you would expect CSR to be important 
and how much enhancement of the power there would be. What is the Derbenev criterion 
in this situation? 
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beam frequency spectrum, 202 
beam loading, 77 
beam rigidity, 16 
beam-break-up instability, 293 
beamline, 172 
bending radius, 16 
betatron, 27 
betatron phase, 177 
biperiodic cavity, 70 
boost converter, 32 
BPM, 220 
bremsstrahlung, 266 

power, 269 

spectrum, 272 
brightness 

radiation, 247 
bucket (RF), 210 
bunch, 25 


C-dipole, 124 
caesium telluride (Cs2Te), 35 
cavity, 7 
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chirp, 215 
chromaticity, 192, 200, 202 
circular accelerator, 2 
closed-orbit distortion, 188 
co-moving co-ordinate system, 166 
coaxial lines, 51 
Cockcroft-Walton voltage multiplier, 32 
combined function dipole magnet, 112 
combined function magnet, 116 
Compton scattering, 256 
constant-gradient structure, 76 
constant-impedance structure, 76 
control system, 9 
Coulomb logarithm, 284 
Courant-Snyder formalism, 176 
critical energy, 240 
wiggler, 250 
critical frequency, 240, 245 
critically-coupled cavity, 60 
cross section, 254 
crossbar H-mode structure, 99 
cryogenics, 84 
current density in a coil, 128 
cyclotron, 4, 26 
Dees, 27 
cyclotron frequency, 26, 233 
cyclotron radiation, 233 
polarisation, 234 
emitted frequency, 234 
polarisation, 245 


damping partition number, 264 
damping ring, 266 
damping time, 263 
DBA, 198 
de-ionised water; use in magnet coils, 128 
Debye length, 280 
demineralised water; use in magnet coils, 128 
diagnostics, 8 
dielectric acceleration, 107 
dipole, 4 
dipole electromagnet, 121 
dipole errors, 188 
dipole magnet, 112 
pole shape, 113 
dipole radiation, 244 
disk-loaded cavity, 40 
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disk-loaded waveguide, 73 
dispersion, 192, 195 
double-bend achromat (DBA), 198 
doublet, 175 
drift space, 171 
drift-tube linac, 39, 97 

Alvarez linac, 99 

Widerge linac, 99 
Duane-Hunt law, 268, 271 
dynamic aperture, 265 


eddy currents, 129 
electromagnetic field energy, 17 
electromagnetic plane waves, 19 
electromagnetic radiation, 219 
average power, 229 
bremsstrahlung, 266 
coherent emission, 236 
Compton scattering, 256 
critical energy, 240 
damping, 262 
from an accelerated charge, 221 
from moving charges, 233 
inverse Compton scattering, 258 
polarisation, 245 
radiated power, 223 
radiation resistance, 228 
scattering, 253 
synchrotron radiation, 236 
synchrotron radiation power, 236 
Thomson scattering, 255 
electromagnetism, 4, 11, 12 
electromagnets, 120 
electron emission, 34 
electron guns, 92 
Fowler-Nordheim law, 36 
photocathode, 35 
space-charge-limited emission, 35 
temperature-limited emission, 34 
electron gun, 2 
electron-positron colliders, 28 
electron-volt, 25 
electrostatic acceleration, 32 
ellipse, particle, 178 
emission spectrum, 237 
emittance, 177, 178, 203 
equilibrium emittance, 265 
growth, 282 
emittance compensation, 283 
emittance growth, 264 
emittance ratio, 266 


EMW, 250 
energy 

of a particle, 15 

of an electromagnetic field, 17 
energy acceptance, 265, 285 
energy loss per turn, 241, 243 
energy per nucleon, 29 
energy spread, 264 
envelope equation, 177 

space charge, 282 
equivalent circuit, 58, 89 


F-quadrupole/D-quadrupole, 171 
far-field radiation, 227 
Faraday’s law, 134 
ferrite-loaded cavities, 97 
field emission, 63, 83 
field errors in a magnet, 119 
fields around a moving charge, 220 
flux pinning, 81 
FODO, 183, 185 

stability, 184 
Force on a charged particle, 14 
Frank-Tamm formula, 274 
free-electron laser, 2, 282, 296 
free-free emission, 271 
free-space impedance, 228 
frequency spectrum, 239, 243 
fundamental power coupler, 86 


gallium arsenide (GaAs), 35 
geometry factor, 53 
good field region of a magnet, 124 
gradient, 7 

acceleration, 54 
gradient dipole 

pole shape, 116 
gradient dipole magnet, 112 
group velocity, 47 
growth rate, 284 
gyration, 26, 233 


H-dipole, 121 

Hamilton’s equations, 165 
harmonic, 235 

harmonic number, 27, 208, 251 
heavy-ion acceleration, 28 
Hertzian dipole, 225 
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higher-order mode (HOM) couplers, 88, 292 


higher-order modes, 88 
Hill’s equation, 166, 168 
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hoop stress, 156 


ICS, 258 
impact parameter, 267 
in-vacuum undulators, 159 


incoherent synchrotron radiation, 243 


incoherent tune shift, 281 
induction acceleration, 27 
insertion device, 247, 248 


interdigital H-mode structure, 99 
intrabeam scattering (IBS), 283 
inverse Compton scattering, 258 


flux, 261 

power, 260 
ion source, 2, 28 
ion sources, 37 
IOT, 103 
isochronous, 26 
ISR, 243, 244 


K parameter, 249 


Kapchinksy-Vladimirsky (KV) distribution, 
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Kilpatrick criterion, 37 
Kilpatrick limit, 63 
klystron, 40, 103 


laminar beam, 282 
laminated steel yokes, 129 
Laplace equation, 112 
Larmor formula, 225, 258 
Larmor frequency, 233 
lattice, 7 
lattice design, 183, 186 
lattice functions, 177, 182 
LHC, 162, 202 
linac, 2 
linear accelerator, 1 
linear matrix formalism, 170 
lines of flux, 113 
Liouville’s theorem, 186 
Lorentz equation, 4, 167 
Lorentz factor y, 15 
Lorentz force, 14 

work done, 15 
loss factor, 289 


MAD-X, 181 

magnet 
AC, 120, 134 
AC losses in steel, 136 
allowed errors, 119 


coil dominated, 153 
coils, 128 
combined function, 112 
cooling of coils, 128 
cos-theta magnets, 153 
current density, 128 
degaussing, 136 
dipole, 112, 121 

C type, 124 

H type, 121 

permanent magnet, 143 
eddy currents, 129, 134, 137 
electromagnets, 120 
families, 118 
ferrites, 137 
field errors, 119 
flux lines, 113 
forces, 130 
good field region, 124 
gradient dipole, 112 
hysteresis, 134, 136 
hysteresis loop, 136 
hysteresis losses, 137 
inductance, 130 
kickers, 136 
laminations, 129 


longitudinal pole termination, 129 


multipole definition, 118 
multipoles, 112 

normal, 119 

normal conducting, 120 


permanent magnet dipole, 143 
permanent magnet quadrupole, 146 


permanent magnets, 138 
polarity checks, 132 
procurement, 132 
pulsed, 120 
quadrupole, 112 
normal conducting, 125 
permanent magnet, 146 
reliability, 131 
septum magnets, 138 
sextupole, 112 
shimming, 124, 127 
skew, 119 
solid steel yoke, 129 
static, 120 
steel permeability, 124 
steel yoke, 128 
stored energy, 130 
superconducting, 4, 149, 248 
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superconducting materials, 151 dipole, 143 

superferric magnets, 156 adjustable, 145 

water cooling of coils, 128, 131 Halbach quadrupoles, 146 
magnet family, 112 load line, 144 
magnetron, 40 materials, 140 
magnetrons, 105 quadrupole, 146 
map, 175 adjustable, 147 

one-turn, 181 radiation damage, 142, 158 
Marx bank, 33 remanent field, 140 
matrix equations, 170 temperature effects, 140 
Maxwell’s equations, 12, 112, 121 undulators, 147 
MBA, 265 working point, 144 
microphonics, 84 permanent magnets, 138 
mini-@ principle, 186 permeability of steel, 124 
misalignments, 266 perveance, 35, 282 
momentum acceptance, 265 phase advance, 177, 182 
momentum compaction, 198 RF, 68 
momentum compaction factor, 200, 265 phase stability, 26, 207 
momentum of a particle, 15 phase velocity, 47 
monochromator, 252 photocathode, 35 
MPW, 250, 266 photoelectric effect, 35 
multi-charge state acceleration, 29 photon emission, 241, 245 
multibend achromat, 265 Pierce geometry, 36 
multipactor, 64, 92 plasma, 25 
multipole, 250 plasma acceleration, 7, 107 
multipole magnets, 112 plasma bremsstrahlung, 271 
multipole wiggler, 250, 266 plasma cooling, 271 
muon acceleration, 29 PMW, 250 
muon colliders, 29 Poisson’s equation, 283 

polarisation, 239, 245 

natural emittance, 265 pole shape, 113, 114, 116 
near-field radiation, 227 positron accelerators, 28 
neodymium ion boron (NdFeB), 250 positron source, 28 
non-linear motion, 215 Poynting flux, 231 
normal conducting magnets, 120 Poynting vector, 18, 48, 222, 223, 244, 254 
normal conducting quadrupole, 125 modified Poynting vector, 64 
normal magnets, 119 Poynting’s theorem, 19 
nose cones, 56 PPM, 250 


proton source, 28 
pulsed heating, 66 
pulsed magnets, 120, 136 


over-coupled cavity, 60 
overvoltage, 265 


particle accelerator, 1 Q factor, 53, 58 
particle beam, 25 
particle bunch, 7 
particle motion in EM fields, 16 
Penning source, 37 
permanent magnet 

aging, 142 

coercivity, 140 

cryogenic, 140 


quadrupole, 2, 4, 171 
integrated strength, 172 
pole shape, 114 

quadrupole errors, 190 

quadrupole magnet, 112 

quantum constant, 263 

quarter-wave antenna, 228 

quarter-wave cavities, 94 
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radiation 
from a moving charge, 233 
from a synchrotron, 236 
radiation damping, 262 
IBS, 283 
radiation integrals, 262 
radiation pressure, 23 
radiation spectrum, 237 
radio-frequency quadrupoles (RFQs), 101 
radiotherapy, 267 
Rayleigh scattering, 255 
reference orbit/trajectory, 188 
reflected power, 60 
refractive index, 19 
relativistic factor y, 15 
relativity, 4, 15 
residual resistance, 81 
resonance, 7, 189, 192 
rest mass and energy, 15 
retarded time, 222 
RF acceptance, 265 
RF breakdown, 63 
RF cavity, 40, 41, 48 
multi-cell cavities, 67 
standing-wave, 69 
travelling-wave, 72 
RF coupler, 85 
RF power loss, 53, 75 
RF windows, 87 
Robinson sum rule, 264 


samarium cobalt (SmCO), 250 
scalar potential, 113 
scattering 

beam, 283 
separatrix, 210 
sextupole 

pole shape, 116 
sextupole magnet, 112 
sextupoles, 202 
shimming a dipole magnet, 124 
shimming a quadrupole magnet, 127 
shunt impedance, 55, 56 
side-coupled cavity, 70 
simple harmonic motion, 164 
skew magnets, 119 
solid steel yokes, 129 
solid-state power amplifiers, 105 
space charge, 36, 280 
special relativity, 15, 23 
spectral power, 245 
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spectrum, 237 
spoke cavities, 93 
stability, 162 
standing-wave structure, 69 
static magnetic fields, 120 
steel yoke of magnet, 128 
storage ring, 2, 263 
current, 242 
stripping of H7, 28 
strong focusing, 27 
sum wake, 292 
superconducting magnets, 4, 149 
forces on coils, 156 
hoop stress, 156 
inclusion of steel, 153 
quenches, 156 
radiation damage, 158 
training, 157 
undulators, 157 
superconducting materials, 151 
engineering current density, 152 
Rutherford cable, 152 
wire, 152 
wire filaments, 152 
superconducting undulators, 157 
in-vacuum, 159 
radiation damage, 158 
superconductivity 
BCS resistance, 79 
type-I superconductors, 82 
type-II superconductors, 82 
critical magnetic field, 82 
superconducting RF, 78 
synchrocyclotron, 26 
synchronous phase, 265 
synchrotron, 27 
synchrotron frequency, 265 
synchrotron radiation, 236 
angular distribution, 244 
brightness, 247 
critical energy, 240 
dipole radiation, 247 
emittance growth, 264 
energy loss per turn, 241, 243, 263 
incoherent, 243 
induced energy spread, 264 
mean energy, 240 
number of photons, 241 
photon flux, 245 
polarisation, 239, 245 
power, 236, 243 
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scattering picture, 255 
source, 242, 247 

sources, 238 

spectral intensity, 246 
spectrum, 237, 243, 245 
undulator radiation, 251 
wavelength shifter, 247 
wiggler radiation, 248, 250 


target, 267 

temperature, 280 

tetrode, 103 

thermal beam, 282 

thin-lens approximation, 172 
Thomson scattering, 255 
tokamak, 271 

Touschek scattering, 265, 285 
tracking, 176 

transfer matrix, 172, 182 
transit-time factor, 54 
travelling-wave structure, 72 
triode, 103 

tune, 182, 200 

tune spread, 281 


under-coupled cavity, 60 
undulator, 251 
permanent magnet, 147 
undulator equation, 251 
undulators 
permanent magnet, 138 
superconducting, 157 


vector potential, 113 


wakefelds 
long-range, 88 
wakefield, 58 
wakefields, 77, 88, 286 
geometric, 287 
instabilities, 293 
long-range, 290 
resistive wall, 287 
short-range, 289 
transverse, 290 
wave equation, 19 
waveguide, 41 
wavelength shifter, 247 
Widerøe linac, 39 
wiggler, 248 
K parameter, 249 
radiation, 250 


work done by electromagnetic field, 17 


X-ray diffraction, 247 
X-rays, 267 
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